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Preface 

In both classical and quantum mechanics, the Lagrangian and Hamiltonian formalisms play a central role. They are powerful 
tools that can be used to analyze the behavior of a vast class of systems, ranging from the motion of a single particle in a static 
potential field to complex many-body systems featuring a strong time-dependence. 

The aim of this book is to provide an introduction to the Lagrangian and Hamiltonian formalisms in classical systems. We 
will cover both non-relativistic and relativistic systems. This presentation is prepared with an undergraduate audience in mind, 
typically a student at the end of the first or beginning of the second year. In addition to explaining the underlying theory in a 
detailed manner, we shall also provide a number of examples that will illustrate the formalisms ”in action”. 

These lecture notes are primarily based on the teaching of I.B. and follows to some extent also the structure of the excellent 
textbook by Goldstein et al. ’’Classical Mechanics”. We have also included some examples not found in Goldstein inspired by 
instructive examples found in other lecture notes, all of which have been properly cited where they appear. Special thanks goes 
also to Jon Andreas Stpvneng and Simen Ellingsen for their contribution to the lecture notes in this course over the years. 

The lectures given in this course given by J.L. have been recorded on video and uploaded on YouTube. Thus, at the beginning 
of each chapter we provide a link to the YouTube-videos covering that particular chapter. Here is the complete playlist of 
YouTube-videos covering all topics in this book. 

It is our goal that students who study this material afterwards will find themselves well prepared to dig deeper into the 
remarkable world of theoretical physics at a more advanced level. We have carefully chosen the topics of this book to make 
students proficient in using and understanding important concepts such as symmetries and conservation laws, the special theory 
of relativity, and the Lagrange/Hamilton equations. 

We welcome feedback on the book (including any typos that you may find, although we have endeavored to eliminate as many 
of them as possible), and hope that you will have an exciting time reading it! 


Jacob Linder (jacob.linder@ntnu.no) and Iver H. Brevik (iver.h.brevik@ntnu.no) 
Norwegian University of Science and Technology 
Trondheim, Norway 
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I. FUNDAMENTAL PRINCIPLES 

Youtube-videos. 01-04 in this playlist. 

Learning goals. After reading this chapter, the student should: 

• Know how to construct the Lagrange function for a system 

• Be able to write down and solve Lagrange’s equations 

• Know how to incorporate friction in the Lagrangian formalism 


A. Notation and brief repetition 


To begin with, let us establish some notation for the various physical quantities that will appear throughout this book. The 
velocity vector is denoted v = dr/dt , linear momentum p = mv, force F , angular momentum L = r x p, angular torque 
r = r x F. From basic mechanics, we know that Newton’s 2nd law reads: 


F = 


dp 

dt 


( 1 . 1 ) 


and is valid in inertial systems. For now, we may think of an inertial system as a non-accelerated system, meaning that objects 
will move in straight lines at constant velocity unless acted upon by some force. In the special case of a constant mass m, the 
2nd law may be written as F = ma where a is the acceleration. 


There is also an ’’equivalent’” of Newton’s 2nd law which is useful for rotational motion. Consider the torque r: 

dp d . dr dL 

r = r x — = — (r x p) -— x p = —- v x mv 

dt dt v dt ^ dt 


dL 

dt 


( 1 . 2 ) 


It follows from equation (1.1) that if F = 0, then p is conserved (time-independent). Similarly, it follows from equation (1.2) 
that if t = 0, then L is conserved. We will have much more to say about such conservation laws later on in this book. As a final 
preliminary, we also briefly remind the reader of the meaning of a conservative system. The work done by a force F on particle 
moving from point 1 to 2 is defined as: 


Wn= F-ds (1.3) 

where ds is an infinitesimal displacement along the trajectory of the particle. If we for simplicitly assume that m is constant, we 
get 


f , f dv , If 

/ F • ds = m / — • vdt — m- 

J J dt 2 J 


-f- v 2 dt . 

dt 


(1.4) 





z 


The result is then W \2 = = T 2 — Ti. In effect, the work performed on the particle equals the change in the kinetic 

energy of the particle. The system is said to be conservative if the work performed between points 1 and 2 is independent of 
which path one takes between them. Put mathematically, we would write that: 


F • ds = 0, 


(1.5) 
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which implies that the force can be written as F = — VU (r). Here, V is the potential energy which can depend on the position 
r. Note that we can always choose the reference level of zero energy for the potential energy as we please, because adding a 
constant Vo to V does not change the physical force: F = — V[U (r) + Vo] = — VU ( r ) since VVo = 0. We mention in passing 
that a system including friction cannot possibly be conservative, since the net work done upon completing a closed trajectory 
starting at point 1 and ending up at the same point 1 must be positive, in contrast to equation (1.5). 

At this stage, we have two expressions for the work performed on particle moving from point 1 to 2: 

W 12 = T 2 -T 1 =V 1 -V 2 ^T 1 + V 1 =T 2 + v 2 . (1.6) 


In other words: 


The total energy T + V is constant for a conservative system 


B. Many-particle systems 

In a system with many particles, Newton’s 2nd law must take into account both external forces and all forces from interactions 
in the system. For particle i, we get: 


Ff + Y / Fji=P i - ( 1 - 7 ) 

3 

Here, Ff xt is the external force acting on particle i while Fji is the internal force acting on particle i due to particle j. Assuming 
that Fji satisfies Newton’s 3rd law, it has to be equal in magnitude but opposite in direction of F^. Thus, we get Fij = —Fji. 
Using this and performing a summation over all particles i, equation (1.7) becomes: 

+E*v d- 8 ) 

Total external force F ext s v ^ 

= 0 


We define the center of mass (CM) position R\ 


R= YjniVi = TjniTj 

Using R, Newtons 2nd law now has been cast in the form: 

, T d?R 

M—~ = F l . 
dt 2 


(1.9) 


( 1 . 10 ) 


Physically, this means that the center of mass of the many-particle system moves as if all the mass was concentrated in the CM 
position. Moreover, since the total momentum is 


P = E 


dri , r dR 


(l.n) 


we see that if the total external force F ext is zero, then the total momentum P is conserved. This is called the weak law of action 
and reaction. 


The same line of reasoning may be applied to the total angular momentum L = JE r z x p i of the system. Performing a 
differentiation with respect to time and inserting equation (1.7) into this equation, we obtain 

l - E ^ x F i xt + E x F a- a- 12 ) 

i i, j 

i / 3 

The last term can be written as a summation over pairs of the form: 

Vi x Fji + r j x Fij = (n - Vj) x F jU (1.13) 
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where we used that Fij = —Fji. Defining r = ri — rj, we may then write 

l = r ext + 1 £ x F *- (L14) 

i # 3 



J 


The benefit of doing so is clear by inspecting the last term: if the force between particles i and j lies along the line which 
connects i and j - this is the strong law of action and reaction - then x Fji = 0 for all i and j . We end up with: 

L = r ext . (1.15) 

We see that in the same way as for the total linear momentum, the total angular momentum L for a system of many particles is 
conserved if the net external torque r ext is zero. We emphasize that it was crucial for this derivation that the forces were central , 
meaning that forces act along the line connecting any pair of particles. 

It is also instructive to rewrite the total angular momentum L in a slightly different way, which brings out the contribution to 
L both from the CM motion and the relative motion around the CM. The coordinate vector to any particle i may be written as 
Vi = R + r i where r i represents the distance from the CM to particle i. In the same way, Vi = V + We obtain: 

L = ri x Pi = R x rriiV + r i x rriiV i + x V + R x m^rj. (1.16) 

i i i i i 



CM 


By considering the definition of the CM position R and the relative coordinate r i9 one finds that (JT ^r-) =0. Thus, the two 
last terms in the above equation vanish and we are left with 

L = Rx MV + r i x p i . (1.17) 

In other words, the total angular momentum around the origo is equal to the angular momentum of an object positioned in the 
CM with the total mass M of the system plus the angular momentum around the CM itself. We see that if the CM is stationary, 
R is constant and thus V = 0, meaning that L is equal to the angular momentum around the CM. 

The contribution to the kinetic energy of a many-particle system can also be split up in the same way: a part pertaining to the 
motion of the CM and a part pertaining to the relative motion around the CM. We obtain: 

t = | e m ^ = \ E m ^ v + v 'i)( v + v 'i ) = \ My2 + \ E ( L18 ) 

i i i 
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C. Constraints and generalized coordinates 

Particles moving around in a system may be subject to constraints. Examples of this would include a gas in a container, where 
the particles cannot have positions outside of the container, or a ball rolling down a hill, with the criterium that the ball must 
always touch the ground. Various constraints can be classified in different ways: 

Holonomic constraints: May be written as f(r i, 7 * 2 ,..., t) = 0. An example is the equation for a rigid body (the distance 
between two points is constant): (r* — rj) 2 — c? = 0. 

Non-holonomic constraints: May not be expressed in the form f(r 1 , 7 * 2 ,..., t) = 0. An example is a person running on a hill: 
his or her position may be on the ground or above the ground, but not inside the hill. If the radius of the hill is a, we thus have 
the constraint r 2 — a 2 >0. 


r 1 - ol >. O 

One also speaks of rheonomous constraints which are time-dependent, and scleronomous constraints which are time- 
independent. Having introduced constraints leads us to the important concept of generalized coordinates. Imagine that we 
have a system with N particles that can move in all three dimensions. We would then say that we have 3N degrees of freedom: 
each particle can move in three different directions, with each one corresponding to a degree of freedom in the particle’s motion. 
If there are k holonomic restrictions present in the system, the number of degrees of freedom will be reduced. Where there were 
originally 3N degrees of freedom, there are now 3N — k. 

To put this mathematically, let r* denote the position vector of each of the N particles. However, not all of these position vectors 
can be independent since there are constraints in the system: for instance, the distance between two particles is fixed for a rigid 
body. Thus, there are instead 3N — k independent coordinates which we name gi, ^ 2 , • • • Q3N-k • These generalized coordinates 
thus take into account the constraints of the system and can be used to describe the position vectors r*. We have: 
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ri = r 1 (qi,q 2 ,. 

• • Q3N—k 5 f) 5 


r 2 = r 2 (qi,q 2 , ■ 

• -q3N-k,t), 

(1.19) 

Tn = r N (q 1 ,q 2 ,. 

• • Q3N—ki t) 1 

(1.20) 


Example 1. Double-pendulum moving in a plane. With two particles, we would expect to have 6 degrees of freedom. 
However, restricting the motion of the pendulum to take place in a plane, i.e. moving in 2D rather than 3D, we are removing 
one degree of freedom per particle. We are then left with 4 degrees of freedom. When we additionally demand that the distance 
between the particles is constant, for instance having the particles connected by a rigid rod, we are removing one more degree of 
freedom per particle. We thus end up with a total of 6-4=2 degrees of freedom. These 2 degrees of freedom are the generalized 
coordinates. For our particular system under consideration, the generalized coordinates are 0\ and 62 in the figure. 



D. D’Alembert’s principle and Lagrange’s equations 

As a preliminary to this section, we first define the concept of a virtual displacement', it is an infinitesimal displacement Sri of 
the coordinates of the system which respects any constraints that are present. Assume first that we’re dealing with a system in 
equilibrium. This means that all forces acting on particles is equal to zero, Fi = 0. As a result, we also have JA Fi • Sri = 0. 
Now, the force acting on particle i can be split up into an externally applied force F% and a force f i resulting from constraints 
in the system: 


F t = F« + f z . (1.21) 

An example of a constraint force would be the force exerted by the wall on a particle inside a gas container. It is reasonable to 
assume that fpSri = 0: any force constraining the motion of a particle i should act perpendicularly to any allowed displacement 
Sri. Another way to put this is to say that forces from constraints do no work. We can see this e.g. for the normal-force from 
the floor acting on objects on it. When this force acts, the object is only allowed to be displaced perpendicularly to it (along the 
floor). It is important to note that we are excluding friction in this way, which would not satisfy f i • 5ri = 0. However, we shall 
return to see how friction may be incorporated later on. We are now left with 

F“ ■ 6rt = 0 . ( 1 . 22 ) 


Note that this equation does not imply that each F “ is zero, since the Sri vectors are not independent in general because of 
the presence of constraints. Only if we use generalized coordinates, which as we have seen take into account the presence of 
constraints, can we say that the coordinates are independent on each other. 

Let us now turn to the more interesting case of a system in motion, in effect out of equilibrium. The equation of motion for 
particle i is then given by Newton’s 2nd law, F i —p i = 0. Analogously with the static case above, we may decompose the force 
into an applied part and a part due to constraints. Performing a summation over i and taking the scalar product with the virtual 
displacement, we obtain: 

Y J {F a i -i> i )-5r i = 0. (1.23) 
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This is called D’Alembert’s principle and the merit of this equation is that we have now eliminated the forces stemming from 
constraints in the system. By assuming (without much loss of generality) that the constraints are holonomic, we may introduce 
generalized coordinates as described earlier: 


Vi =r i (q 1 ,q 2 ,...q n ,t). 


(1.24) 


It follows that the velocity of particle i is given by (via the chain rule): 


v— v dri . dri 

Wi Qi + ~di 


v i (q l ,q l ,t). 


(1.25) 


Similarly, the virtual displacement Srj of the position vector itself may be written in terms of the virtual displacement of the 
generalized coordinates: 


5Vl = zL jf Sq )- (L26) 

3 3 

Note that virtual displacements involve only coordinate-displacements and not time, so that St does not enter: the displacement 
takes place at a fixed time. Let us now examine D’Alembert’s principle in more detail to see what comes out of it. The first term 
is: 




dq ^ — 53 ' 

3 3 


(1.27) 


We defined the generalized force Qj = JA Fj • Note that the dimension of QjSqj is work. The dimensions of Qj and Sqj 
themselves will however depend on the geometry. For instance, if qj denotes an angle with dimension rad, the dimension of Qj 
will be J/rad. 


We now look at the second term of D’Alembert’s principle. 


Y.P >' Sri = 53 ™^ ■ 




dri 

dqj 


We can rewrite as follows: 


•• d r i 

i d( l3 


drj v 


d drj 




In the last term in the above equation, we may exchange the order of and : 

d drj d drj 
dt dqj dqj dt 


(1.28) 


(1.29) 


(1.30) 


To see why this is allowed, we look at how these operators act on rj in detail. If we first differentiate with respect to qj and then 
with respect to time, we get: 


d drj ^ d 2 rj . d 2 rj 

dt dqj “ dqjdqk ^ dqjdt' 


(1.31) 


Now reverse the order: differentiate with respect to time first and then with respect to qj to obtain 

d drj d dr ^ drj . drj \ 

dqj dt dqj V% dqj V ^ dqk ^ ^ dt ) 

The result is seen to be identical. In the same way, one can also show that 

dvj drj 
dqj dqj' 


(1.32) 


(1.33) 
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Using the above relations in equation (1.29), we get: 



d dT dT 


dt dqj dqj 

We may then rewrite D’Alembert’s principle in the following way: 


?[ 


d dT 
dt dqj 


dT 

dqj 


Qj dqj — 0 . 


(1.34) 


(1.35) 


Actually, we can make an even stronger statement: each term inside the summation has to be zero individually. The key to making 
this statement is to realize that with holonomic constraints, which we assumed, all generalized coordinates are independent of 
each other. Thus, dqj are independent quantities and the following n second order equations must be satisfied: 


d dT _dT 
dt dqj dqj 


(1.36) 


Let us make one more assumption, which still includes a vast number of physical situations, and set our system to be conservative: 
Fi = — VfU, with V = V(qj). We then get for the generalized force, using the chain rule, 


i 


dri 

dqj 


V dc d d( lj 


(1.37) 
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The potential V depends on the coordinates, but not on the generalized velocities q^. We will later see how to deal with velocity- 
dependent potentials. In that case, we may add a term to equation (1.36) which is equal to zero, namely BV/dqj. We have thus 
ended up with 

d BL _ BL 
dt dqj dqj 


where we defined the Lagrange function L = T — V . These are the Lagrange equations: arguably the most important equations 
in this entire book. They were derived under the assumption of a holonomic and conservative system. There are in general n 
Lagrange equations, each of second order. We thus need 2 n initial conditions in all. 

An important feature to notice is that L is in fact not uniquely defined. The physics is unchanged if we instead of L use L' where 
L' = L + as can be verified by direct insertion. The Lagrange equations remain the same, as we will now see an example 

of in the last subsection of this chapter. 

Note that in deriving the Lagrange equations, we have ended up with simpler equations that involve scalar functions, the kinetic 
energy T and potential energy V, rather than working with vectors such as the force Fi and acceleration a, for each particle. 
Moreover, we have automatically included the role played by constraints in the system since it is the generalized coordinates qi 
that enter the Lagrange equations. 


E. Levi-Civita symbol 

Before proceeding to discuss extensions and modifications of the Lagrange equations, it is useful here to introduce the mathe¬ 
matical quantity known as the Levi-Civita tensor e^. To see how this works, we also introduce some convenient notation for 
working with vectors. Using Cartesian coordinates, we write 

A- B = ^ j A i B i = AiBi, (1.38) 


which is known as a sum convention: repeated indices implies summation over them. We may then also write: 

BA 

V • A = —i = diAi (1.39) 

Bxi 

and 

V0 = eidih (1.40) 

where is the unit vector in i direction. As for the Levi-Civita symbol, it is antisymmetric in all indices and changes sign when 
two indices exchange position. Moreover, it is equal to zero when at least two indices are the same. Thus, we have that 

tijk = +1 when i, j, k are exchanged in a cyclic manner(£i 23 = +1), 

Eijk = — 1 when i, j, k are exchanged in an anticyclic manner(£i 32 — 1), 

(1.41) 

This notation is particularly useful when dealing with cross-products. If A = B x C, then Ai = Cij^BjCk. Since j and k are 
repeated indices, a summation over both is implied. It is also handy to note the relation 

^ijk^ilm fijl^km ^jm^kl- (1*42) 


Example 2. Using the Levi-Civita tensor. 

(Ax B) • (C x D) = (Ax B)i(C x D)i = e ijk AjBkeumCiDm 

= (SjiSkm - SjmSktiAjBkQDn = (A . C)(B • D) - (A • D)(B • C). (1.43) 

Note that quantities such as e^k , Aj , B^ are scalar which means that they can be moved around as we please (scalars commute). 
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F. Friction and other velocity-dependent potentials 


Armed with the Levi-Civita tensor, we now pose the question: what happens with Lagrange’s equations if the potential depends 
on velocity? The answer is that Lagrange’s equations keep their form if L = T— U and Qj = — ^ where U = U(qj,qj) 

is the velocity-dependent potential. Here qj and qj are regarded as independent variables. 

An important example of this is the electromagnetic potential. From electrodynamics, we know that the Lorentz-force is the 
force acting on a particle with charge q moving in an electric E and magnetic B field with velocity v : 

F = q(E + vx B). (1.44) 


The electric and magnetic fields themselves can be written in terms of a vector potential A and a scalar potential 0. This can be 
seen by considering two of Maxwell’s equations: 


V • £? = 0, VxE = - 


dB 

dt 


(1.45) 


The first of these equations shows that we can always write B = \7 x A since the divergence of a curl is zero. Inserting this into 
the second equation, we obtain 


Vx (£ + —)= 0. (1.46) 

Since the curl of a divergence is also zero, it follows that we can write E + ^ = —V</>. We can now express the Lorentz-force 
in terms of A and instead of E and B : 


F = q(-V<f> —— + vxVx A). (1.47) 

You may wonder why we would like to replace the electric and magnetic fields with the vector potential and the scalar field. 
What is the benefit of this? This is related to a concept known as gauge-invariance which we shall return to later on in the 
chapter on the special theory of relativity. 

To continue working with the Lorentz-force, it is convenient at this point to make use of the Levi-Civita symbol. By using that 

[v x (V x A)\i — VjdiAj — VjdjAi , (1.48) 


we get: 


Fi = q{—di(j) - d t Ai + VjdiAj - VjdjAi). (1.49) 

Note that d h and Aj do not commute, since di is an operator acting on whatever comes after it. On the other hand, d h and Vj 
do commute since the velocity Vj has no explicit dependence on position (it is, as we know, defined as the derivative of position 
with respect to time). It follows that the relations below hold: 


VjdiAj 

dAi 

dt 


djv • A), 


dA 

dt 


VjdjAi , 


and thus we can rewrite equation (1.49) as 


F i = q[-d i <l> + d i (vA)-^] 


(1.50) 


(1.51) 


where we defined U = qrf> — qA • v = U(r,v). Note that in the second term we have made use of the fact that 0 is a function 
of r and t only; thus d(j)/dvi = 0. 
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We have managed to write the force on a form which is consistent with the Lagrange equations in the form we derived in the 
previous section, and we thus identify U as the generalized velocity-dependent potential. The Lagrange function is then: 

L = T-U = T-q(j) + qA'V. (1.52) 

This is called minimal coupling in field theory. We emphasize that U is not the potential energy of the particle, unlike the case 
we considered previously where the potential energy only depended on position. In fact, it is instructive to consider in some 
more detail whether or not the electromagnetic force is conservative or not. 

We know from its definition that a conservative force can be written as the gradient of a scalar potential, and ensures that 
energy T + V is conserved. If we only have a pure electric field, we see that the Lorentz-force can indeed be written as 
the gradient of a scalar potential since E = —V0. A purely electric force is thus conservative. But what if we also have a 
magnetic field? In this case, it is clear that the Lorentz-force cannot be written as the gradient of a scalar potential. Hence, 
magnetic forces are formally classified as non-conservative. What is the implication with respect to energy conservation? 
Well, we know that magnetic forces do no work since the force acts perpendicularly to the velocity (due to the cross-product 
between velocity and magnetic field). So while the magnetic force is formally classified as non-conservative, as it cannot be 
written as the gradient of a scalar potential, that does not necessarily mean that energy is not conserved. For instance, energy 
is stored in the electromagnetic field itself which thus in principle can be converted into mechanical energy for a charged particle. 

In general, to calculate the magnetic field energy built up when a magnetic field is being applied, we must examine the electric 
fields induced by the change in the magnetic field and determine the work done by these fields on the currents producing the 
magnetic field. The electric field here is in a sense a second, though indispensable, ingredient. The total energy, however, has 
to be conserved. We will later derive the fundamental result that the exact criterium for energy to be conserved is that there is 
no explicit time-dependence in the Lagrange function L. So as long as the functions (j> and A are time-independent, energy is 
conserved even if the magnetic force is said to be non-conservative. 
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Another important non-conservative force is friction. For a holonomic system, Lagrange’s equations may always be written in 
the form: 


d^8L_ _ dL_ 
dt dqi dqi 


(1.53) 


where L contains the potential from conservative forces while Q h contains the forces that cannot be derived from a potential. 
Frictional forces are an example of this. The origin of friction is actually electromagnetic in nature, but frictional forces may 
often be well accounted for by a phenomenological form: namely, by setting the frictional force Ff to be proportional to the 
velocity v of the particle. For one particle moving along the x axis, we thus have 


-^f,x — k x v x 


d 

dv x 



The work performed by the system to overcome the frictional force can be computed as follows: 


dWf = —Ff • dr = —Ff • vdt = ( k x v 2 x + k y Vy + k z v^)dt. 


(1.54) 


(1.55) 


For several particles moving in three dimensions, Ff = —X7 V F. Here, we have introduced Rayleigh’s dissipation function 
T = \ J2i(k x ^i x + kyvf y + k z v? z ) where the subscript i denotes particle i. We see that we can then express the rate of energy 
loss due to friction via the dissipation function: it is simply dWf/dt = 2 T. The generalized force stemming from friction may 
be computed by going back to the definition of generalized forces: 


n — zr- d r i _ V7 77 dri _ V7 x* dr i _ dJ- 


(1.56) 


Lagrange’s equations now read: 


d^dL_ _dL dT_ 
dt dcu dqi dq { 


(1.57) 


In other words, we need to know two scalar functions L and T in order to obtain the equations of motion for a system with 
friction. 


The case where Ff is proportional to v is actually of great importance in physics, especially in microfuidics and in medical 
technology. It ought to be emphasized, however, that the equation is applicable only under the condition that the velocity v 
is ’’small” when looked upon in conjunction with the viscosity of the fluid through which the body is moving. Explicitly, the 
condition is that the so-called Reynolds number Re must be much less than one. The definition of the Reynolds number is 
Re = pvl / /i, where p is the density of the surrounding fluid, l a typical length, and p the dynamic viscosity. For a sphere of 
radius R , for instance, one can simply put l = 2R. As an example, consider a human cell for which approximately R = 5 pm, 
traveling at a speed not greater than 10 pm/s. Then Re is of order 10 -4 , showing that the above condition is amply satisfied and 
that viscous forces are dominant. 


Another case is a DNA molecule, which can be stretched into a linear strand by hydrodynamical means. Imagine that one end 
of the DNA is attached to a glass plate and that a spherical bead is fixed to the other end, giving hydrodynamic drag to a viscous 
liquid flowing past, parallel to the plate. In this way the DNA becomes stretched. Increasing the fluid velocity until the strand 
snaps, one can actually determine the elastic strength of the strand. 
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G. Examples 

In this subsection, we want to look at the Lagrange equations ”in action” and so we consider several examples that illustrate their 
usage. 


Example 3. One particle with Cartesian coordinates. Before proceeding to derive the Lagrange equations for this system, 
which effectively constitute the equations of motion describing the particle, take a moment to think about what you would expect 
that the equations look like. That’s right - we should expect to recover Newtons 2nd law for a particle moving under influence 
of a force. For our system, we have T = \m(x 2 + y 2 + i 2 ). It follows that: 

d T dT OT 

dx dx dx 

dT dT dT 

— = mx, — = my, — = my. 
dx dy dy 

(1.58) 

The equation of motion is: 

d dL dL 

dt dqi dqi 1 

where q\ = x, q 2 = y,q% = z and Qi = i^. The equations of motion become 

(1.59) 

ii 

(1.60) 

and identical for y and z, meaning that we as expected recovered Newton’s 2nd law. 



Example 4. Atwood’s machine. There is only one independent (generalized) coordinate, namely x. To construct the Lagrange 
equations, we first need to identify the Lagrange function L. To do so, we need the potential energy V and kinetic energy T. 

V = —Migx - M 2 g(l - x), 

T=^(M 1 +M 2 )x 2 , 

L = T — V = i(Mi + M 2 )x 2 + M ig x + M 2 g(l - x). (1.61) 

The Lagrange equation for x then becomes: 

(Mi + M 2 )x = (Mi - M 2 )g. (1.62) 




- - - — 


X 



c 

] 

' C 

i-x 

3 


21 


Download free eBooks at bookboon.com 







Introduction to Lagrangian & 
Hamiltonian Mechanics 


Fundamental principles 


Example 5. Pendulum driven at the pivot. This example is a slightly modified version of the one found in ”Structure and In¬ 
terpretation of Classical Mechanics” section 1.6.2 by Gerald Sussman and Jack Wisdom. We previously stated that the Lagrange 
function is not uniquely defined: we could add a total derivative of a function, dF(q , t)/dt, to L without changing the equations 
of motion. To see how this fact can be used to one’s advantage, consider the system shown in the figure: a pendulum driven by 
vertical motion of the pivot V which slides along the y- axis. The pendulum itself is taken to be a point mass m which gravity 
acts upon. Since the pivot is driven, for instance by some engine or by hand, its vertical position is a given function of time y s ( t ). 



In order to construct the Lagrange function, we first need to establish what the generalized coordinates are. The mass m can 
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move in the 2D plane, which naively might suggest that there are 2 degrees of freedom. In reality, however, there is only one 
because the length l of the rod is fixed. Thus, the only degree of freedom is the angle 0 and this is our generalized coordinate. 
The position of m can then be written as 


x = l sin 0 and y = y s (t) — l cos 0 . 


The corresponding velocities are 


v x = WcosO , v y = y s (t) + 10 sin 0 . 

We can now write down the kinetic energy T as 


(1.63) 


(1.64) 


T = ^m(vl+vl) = 1 m[l 2 9 2 + (y s ) 2 + 2lyjsm0}. (1.65) 

The potential energy is V = mgy = mg(y s — l cos 0). The Lagrange equation may now be obtained by using L = T — V: 

Ol 2 m + ml(y s + g) sin# = 0. (1.66) 

Here’s a tip: whenever you derive an equation, pause for a moment and consider it to see if it makes physical sense. What is the 
above equation telling us? 


When deriving an equation, it is often useful to check limiting cases in order to see if we recover a physically sensible result. 


By inspecting the above equation, we see that an interesting interpretation emerges: the equation of motion is identical to that of 
an undriven pendulum except that gravity g has been replaced by g + y s . This means that the effective acceleration of the mass 
is gravity augmented by the acceleration of the pivot itself. This is physically sensible. However, we probably could not have 
guessed that this would be the case just by looking at the Lagrange function. This is where the power of the non-uniqueness of 
the Lagrange function comes into play. Namely, by writing an alternative Lagrangian which has the same equations of motion, 
but which is much easier to interpret than the original Lagrange function. Consider thus the following Lagrange function 

Z/(0, 0, t ) = ^m/ 2 # 2 + ml(g + y s ) cos 0. (1.67) 


With this Lagrange function, it is immediately clear that the accelerating pivot has the net effect that it modifies the acceleration 
due to gravity. The equation of motion obtained from this Lagrange function is identical to equation (1.66), and hence both L 
and L' give exactly the same physics. For this to be the case, we then know we should to be able to write the difference between 
L' and L as a total time derivative. The difference between the two is: 


A L 


L — L' = - my 2 + mly s 0 sin 0 


gmy s — mly s cos 0 . 


( 1 . 68 ) 


There are four terms. Two of these terms are independent on both 0 and 0. This means that they act as constants with regard 
to the Lagrange equation and thus have no effect. They can simply be discarded: one can always add an arbitrary constant to a 
Lagrange function without changing the physics (think of this as redefining the potential energy minimum). Now, the two other 
terms can indeed be rewritten as a total time derivative: 


+mly s 0 sin 0 — mly s cos 0 = 


dF(t, 0) 
dt 


where F(t, 0) = —rnly s cos 0. 


(1.69) 


We have thus established both mathematically and by physical intuition why the two Lagrangians give the same result. 
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II. LAGRANGE’S EQUATIONS AND THE VARIATIONAL PRINCIPLE 

Youtube-videos. 05-10 in this playlist. 

Learning goals. After reading this chapter, the student should: 

• Understand the foundation of the variational principle and be able to use it in practical calculations 

• Know how to include non-holonomic constraints in Lagrange’s equations 

• Understand the close relation between symmetries and conservation laws 


A. Hamilton’s principle 

In the previous chapter, we derived Lagrange’s equations from a differential principle (D’Alemberts principle) by considering 
small virtual displacements from a given state. In this chapter we instead derive Lagrange’s equations from an integral principle. 
What this means is that we will consider variations in the motion of the entire system between two times t\ and £ 2 . 

Let us clarify what is really meant by the ’’motion of the entire system”. We define the configuration space as spanned by the 
axes of the n generalized coordinates {^ 1 , q 2 ,Qn} (n = 3£V — k). The position or state of the system is at any time t given by 
one point in this configuration space. The motion of the system is thus described by a curve in configuration space where each 
point on the curve represents the entire system’s configuration at a specific time. One advantage of using Hamilton’s principle is 
that we are deriving the dynamics of the system from an expression which depends on the motion of the entire system between 
times t\ and £ 2 - This makes it convenient to generalize it to quantum mechanics since in this formulation all possible paths the 
system can take contribute. 

We now define mathematically what Hamilton’s principle means. The system will move from time t\ to £2 such that the action 
I defined as 


I = [ 2 Ldt (2.1) 

Jti 

has a stationary value (also known as extremal value). Here, L = L(q, q,t) = T — V is the Lagrange function and we have a 
conservative system if V = V(q). Hamilton’s principle is also valid more generally if V U = U(q, q, £). Systems described 
by either U or V are called monogenic. Hamilton’s principle may also be expressed as follows: 

Pt2 

51 = 5 L(q 1 ...q n ,q 1 ...q n ,t)dt = 0, (2.2) 

Jti 

and we shall see that Lagrange’s equations follow from Hamilton’s principle. 



X 


B. Derivation of Lagrange’s equations from Hamilton’s principle 

Assume for simplicity that we just have one degree of freedom q = q(t). The proof below is straight forward to generalize to 
multiple degrees of freedom qi. 
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One way to parametrize different curves going from t\ to £2 is by a parameter a such that a = 0 corresponds to the stationary 
value of I (see figure). The action is then written as 


A virtual variation for a fixed t is then 


The variation of I reads: 


pt2 

1 (a) = / L[q(t, cl), q(t, cl), t]dt. 
Jt 1 


S Q = (jr) da,6 q=(da. 
\oaJa= 0 \oaJa= 0 


dq' 


51= f 2 [^5q + ^5q]dt. 
Jt! 


' dq dq 


(2.3) 


(2.4) 


(2.5) 


Since the variations are taking place at a fixed time, we may exchange the operations S and d/dt so that Sq = -^Sq. In this way, 
we obtain: 



A cate-Lucent 


www.alcatel-lucent.com/careers 


What if 
you could 
build your 
future and 
create the 
future? 


One generation’s transformation is the next’s status quo. 
In the near future, people may soon think it’s strange that 
devices ever had to be “plugged in.” To obtain that status, there 

needs to be “The Shift". 



Download free eBooks at bookboon.com 

















Introduction to Lagrangian & 
Hamiltonian Mechanics 


Lagrange's equations and the variational principle 


■-/ 

Jt i 

=/ 

Jt i 


dL. 8L d ■ 
ld, St+ gjdt St 


dt , 


dL 

dq 


Sqdt 


dL f t2 ddL 


( 2 . 6 ) 


=o 


The ’’surface term” (second term in the last line of the above equation) vanishes as there is no variation at the end points t = t\ 
and t = t 2 - Since the path taken by the system is determined by SI = 0, and since Sq is arbitrary, it follows that the integrand 
itself must be zero: 


d^dL _ dL 
dt dq dq 


(2.7) 


We have thus recovered Lagrange’s equations. Generalized to multiple degrees of freedom, the same derivation above gives the 


same result with q q i9 i = 1, 2,... n. We underline that the generalized coordinates have to be independent, in effect we are 


using holonomic constraints. The above procedure is valid for conservative systems [where V = V(qi)] and non-conservative 
systems when Qi — — ^ with L — T — U. 


C. Variational calculus 

The idea of finding the extremal value of an integral has practical use beyond the derivation of the Lagrange equations above. 
Let’s say that we are interested in finding the extremal values of the integral 

rx 2 

1 = f(y,y',x)dx (2.8) 

7 x\ 


where f(y , y\ x) is a function defined on the curve y{x). Here, y' means dy/dx. The task is then to find the curve y{x) which 
gives I its extremal value, in effect SI = 0. Now, we can simply reuse our result from the previous subsection. The function / 
which satisfies the following equations will give I an extremal value: 


dj__d^dj_ 
dy dx dy' 


(2.9) 


In this context (variational calculus), these equations are known as the Euler equations or Euler-Lagrange equations. In practice, 
this would be a relevant problem to solve for instance if one wishes to minimize some quantity I with respect to a function /. 
To be concrete, / could be the amount of paint one has to use to paint a surface characterized by the function /. Let’s use this as 
an example. 


Example 6. Minimize surface area of an object. Consider a curve between two fixed points (xi,yi) and (#2,2/2)- Now 
revolve this curve around the y-axis to produce a surface (see figure). Our task is then to find the curve y{x) which gives the 
minimal area of the surface of revolution. 
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First, we need to find an expression for the area A of the surface. The area of the stripe ds is given as 2irxds = 
27 xxy/dx 2 + dy 2 = 2irx^/l + ( y') 2 dx . The total area is then given by integrating this from x\ to X 2 ‘- 

PX 2 _ PX2 

A= / 2ttx\/1 + ( y') 2 dx = 2tt / f(y,y',x)dx. (2.10) 

Jx 1 Jx 1 

We defined f(y , y', x) = xy 7 ! + (?/) 2 . We can now use Euler equations to find the curve / which makes A have an extremal 
value. Plugging / into equation (2.9), we get 


U , X JL )=o 

dxK^J l + (y') 2 ' 


( 2 . 11 ) 


This equation may be solved for y as follows. The expression inside the paranthesis has to be a constant, let’s call it a. Rewriting 
the expression in terms of y ', we get 


Integrating with respect to x gives the solution 



a 


\/x 2 — a 2 


y = a acosh (x/a) + 6, 


( 2 . 12 ) 


(2.13) 


where b is the integration constant. Alternatively, we have x = a cosh [(y — b)/a\. The boundary conditions y(pc i) = yi and 
y{x 2 ) = 2/2 will then determine the coefficients a and b. 


D. Hamilton’s principle for non-holonomic systems 


Up to now, we have mainly considered holonomic constraints. Recall that such constraints may be written mathematically as 
/(r*i,... t) =0. Thus, with j holonomic constraints we were able to introduce n = 3N — j generalized coordinates which 
were all independent. With n generalized coordinates and using our treatment above, Hamilton’s principle becomes: 



d <9L\ _ 
dt dqj 5qk 


= 0. 


(2.14) 


The argument which allowed us to set the integrand, rather than the entire integral, to zero was that all Sqk are independent. 
Thus, one arrives at Lagranges equations 


d dL 
dt dq k 


-_ = 0, k = l,2,...n. 
OQk 


(2.15) 


However, we are now considering non-holonomic systems and the main difference is that not all Sq k are now independent. To 
be concrete, let us assume that we have m constraints f(qi,qi,t ) = 0 (note the dependence on cy which makes the constraint 
non-holonomic) of the rather general form: 


n 

aijdt + ^2 a bkdqk = 0, l = 1, 2,... m. (2.16) 

k=1 

By dividing the entire equation on dt , we recast it into a form which depends on q k . The coefficients a^ k and a^ t are allowed to 
depend on the generalized coordinates q and time t, making the constraint above quite general. Any virtual displacements of the 
generalized coordinates Sq k have to be in accordance with the constraints 

n 

^2 a bkdqk = 0. (2.17) 

k=l 
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The reason St does not enter the expression for the virtual displacement although it enters the constraint is that, as previously 
mentioned, virtual displacements take place at a fixed time t. In order to solve the problem, we will now make use of Lagranges 
method of undetermined multipliers. Recall that we have m equations (2.17) describing virtual displacements which are in 
accordance with the constraints of the system. Now multiply each of these equations with a coefficient A i = Xi(q , t). 

n «2 rn n 

k^2ai t k5qk = 0-» / dt'£'£X l ai, k 5q k .^0. (2.18) 

k =1 ”' 1 1=1 k =1 


We can now combine this equation with Hamilton’s principle derived previously, but with the important cavaet that we can no 
longer use that the virtual displacements 5q k are independent of each other due to the non-holonomic constraints. We thus have 


r 2 n / 

ji k=i 


dL 

dqk 


d dL 
dt dq k 


rn 

Y, Xiai^Sqk 


= 0 . 


(2.19) 


i=i 


We cannot set the integrand (the term inside the paranthesis to zero) immediately since the 5q k are not independent on each 
other. There are a lot of indices in play now, so let’s clarify them a bit. We have in total n generalized coordinates. We also have 
m < n non-holonomic constraints. This means that n-mof the virtual displacements are independent on each other, whereas 
m of them do depend on each other via the relation equation (2.17). 

Here is where we make use of the undetermined multipliers A i that we introduced. We have not specified them so far, but at this 
point it is useful to do so. In fact, let us choose them so that the equation 


dL 

dqu 


d dL 
dt dq k 


m 

+ ^ A ia^k 

1=1 


= 0 


( 2 . 20 ) 


is satisfied for k = n — m + 1,... n. In that case, equation (2.19) reduces to 



d dL 
dt dq k + 


m 

Y, Xiai^Sqk 

i=i 


= 0. 


( 2 . 21 ) 
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Now, we are left with only the virtual displacements Sq k which are independent on each other since the sum runs from 1 to 

n — m and we are allowed to set the integrand to zero for all k = 1, 2 ,... n — m. In total, we then have: 

dL d dL A 

t,- ~r 777 - + A iai k = 0, k = 1 , 2 ,... n. (2.22) 

dq k dt dq k ^ 

These are the Lagrange equations which take into account the non-holonomic constraints. We have n + m unknowns: n 
generalized coordinates qi, . .. q n and m Lagrange multipliers Ai,... A m . To find these unknowns, we then have n Lagrange 
equations and m constraints: 

n 

Y ai,kqk + <H,t = 0 , 1 = 1,2, ...m. ( 2 . 23 ) 

k=1 

We now show an example of how to use the Lagranger multiplier example in practice. Note that this technique can be used even 
when the constraints are holonomic, which makes it quite versatile. 


Example 7. Ring rolling on an inclined plane. 



Initially, it looks like we have two generalized coordinates x and 0. However, we also have a constraint present if the ring is 
supposed to always roll and never slide. The constraint is that the length r • dO of the edge segment that touches the ground has 
to be equal to the same distance dx along the inclined plane itself. By dividing r • d6 = dx with dt on both sides, we see that 
this constraint has the same form as we considered in the previous subsection, namely 

2 

a k q k + at — a x x + clq0 + = 0, (2.24) 

k =1 

if we set a x = 1, a# = — r, and a t = 0. To proceed, we also need the Lagrange function L = T — V . According to the figure, 
we see that 

T = -Mx 2 + -Mr 2 0 2 , 

2 2 

V = Mg{l-x) sin (2.25) 

We defined l as the total length of the plane. We can now obtain Lagrange’s equations for x and 0: 

Mg sin (j) — Mx + A = 0 and MrO = —A. (2.26) 

Together with the constraint x = rO, we now have three equations and three unknowns (x, 6, A). They are readily solved as 
follows: 


gsin</> 


•• g sin ip 

Mg sin 0 


(2.27) 
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The acceleration when the ring slides is seen to be half of the acceleration it would have if it were to slide down the inclined 
plane without friction. In that case, we would have x = g sin f due to the gravitational acceleration. The reason for why the 
acceleration is now half as large is that the potential energy at the top of the plane has to be converted not only to kinetic energy 
for translational motion down the plane, but also for kinetic energy associated with rotational motion. The velocity of the ring at 
the bottom of the plane is found by integrating x = sin 0, yielding v = y/gl sin</>. 


The variational principle that we have discussed, whether in its differential or global form, has several advantages. 

• It is most useful when one can find the Lagrange function expressed via independent coordinates (thus, for holonomic 
systems). 

• This method involves only T and V which are physical quantities that are independent on the choice of coordinates. The 
whole formalism is thus invariant with respect to the choice of coordinates. 

• The framework used above can be employed in several branches of physics. Consider for instance the following Lagrange 
function: 

2 

L = \ c 3% + \ 5Z M ikWk + J2 E i (% ( 2 - 28 ) 

3 3, k 3 3 3 

with the dissipation function T = \ RjOj- The resulting Lagrange equations read 

£jQj + Mjkhk: + RjQj + Qj/Cj = Ej(f) (2.29) 

j^k 

and can be used to describe systems as diverse as 1) a system of electrical circruits coupled via mutual inductances Mjk 
in which q above denotes the electric charges and 2) a system of masses and springs moving in a viscous medium where 
q now denotes the positions. 


E. Conservation laws and symmetries 

If the system we are considering have in total n degrees of freedom, there will be n second order differential equations that 
constitute the equations of motion. A complete solution would thus require 2 integrations per equation, leading to 2 n integration 
constants that must be determined from the initial conditions (start-values for qi ,... q n , q \,... q n .) 

However, in many scenarios we are not necessarily interested in the exact solution for qj (t) for every j = 1... n. Instead, it can 
be more convenient to describe the nature of the system’s motion in terms of conservation laws and symmetries. 

To illustrate this, consider a system consisting of point masses moving in a potential V that only depends on position (i.e. a 
conservative potential). We then have: 


dL _ dT dV _ dT _ 

d±i dxi dxi d±i 


(2.30) 


Note that summation over i is not implied in the second-last term in the above equation. With generalized coordinates we 
define the canonical momentum as 



From the example above, we see that if the potential depends on velocity, the canonical momentum will be different from the 
mechanical momentum. 
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Example 8. Particles in an electromagnetic field. As we have seen previously in this book, the appropriate Lagrangian for 
such a system reads 



m i r i 


^2 + ^2 

i i 


From our definition of the canonical momentum, we find 


Pi,x 


dL 

d±i 


= mi±i + qiA x ± rriiXi. 


(2.31) 


(2.32) 


Another very useful concept in the context of symmetries and conservation laws is cyclic coordinates: 


A coordinate qi is cyclic if L does not contain q im The belonging canonical momentum pi is then constant. 


To see this, we know that dL/dqi if qi is cyclic which follows from its definnition. From the Lagrange equation, we then have 


d_ dL_ 
dt dqi 


d 

-TLPi =Pi= 0, 


(2.33) 


so that pi must be constant. Looking back on the above example with particles moving in an electromagnetic field, we see that if 
the scalar potential </> and vector potential A are both independent on x, then L is independent on x and x is a cyclic coordinate. 
The canonical momentum p x = mx + qA x is then constant, while the mechanical momentum mx is not conserved. 
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What we have seen so far is that there appears to be a relation between a symmetry (meaning an operation which leaves L 
invariant) and a conservation law (a quantity which remains constant). In other words, when the system has a symmetry, 
something is often conserved. This is not a coincidental relation, but actually a very profound result in theoretical physics 
known as Noether’s theorem: 


If a system has a continuous symmetry, there exists a quantity whose value is time-independent. 


It is important to emphasize here that this holds for continuous symmetries, and not necessarily discrete symmetries (such as 
reflection r —r .) 

We will now examine more closely which conservation laws that arise in the context of translational and rotational symmetries. 
Finally, we will also discuss the symmetry which leads to the pivotal energy conservation law (can you already now guess which 
symmetry this is?) 

Let us start off with translational symmetry. Consider a generalized coordinate qj which is defined so that dqj means translation 
of the entire system in a direction n. 



From the figure, it is clear that for all i we have: 

dri _ j rjjqi + dqj ) - rj(qj) _ dqjn 
dqj dqj — dqj dqj 


(2.34) 


Assume that we have a conservative system, V = V(q). The Lagrange-equation 


d dT _dT 
dt dqj dqj 


(2.35) 


holds in general for a holonomic system, as discussed earlier. Velocities, and thus T, are not affected by moving the origo such 
that dT/dqj = 0. It follows, by using the definition of the generalized force Qj : 


Pj ~ Qj — Fi • — F n. 

i ^ 


(2.36) 


In effect, Qj is the component of the total force F along the direction n. The canonical momentum may be computed as follows: 


dT ^—> . dv i ^> . dv i ^> 

n = W =1^™<r,- "V, - W =2^m,-v,-n = P 

J A J A J A 


n. 


(2.37) 


Thus, pj is the component of the total linear momentum P along n. If qj is cyclic, then dL/dqj = 0 and pj is conserved. 


Next, we consider rotational symmetry. Consider a generalized coordinate qj which is such that dqj means a rotation of the 
entire system around an axis n. Using the same arguments as above, we again find that pj = Qj with Qj = JV Fj • dr if dqj. 
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From the figure, we see that 


\dri\ = ri sin Odqj 


dri 


dq-j 


= ri sin 0 


where the vector on the right side of the equation has a direction perpendicular to both r t and n. Thus, we have 

dri 


dq j 


= n x re¬ 


using these relations, we may compute the generalized force 

Qj = Fi • (n x 7*$) = n - (ri x Fi) = n ■ r. 


(2.38) 


(2.39) 


(2.40) 


Here, r is the total torque acting on the system. In the same way, we identify 

Pj = nriiVi - (n x ri) = n ■ L. (2.41) 

i 

Summarizing so far, Qj is the component of the angular torque along n and pj is the component of the angular momentum 
along n. If qj is cyclic, it follows that Qj = 0 so that pj is conserved. In other words, when the system is invariant under 

rotation around an axis, the component of the angular momentum along that axis is conserved. 


Finally, we consider conservation of energy. This is a conservation law that is often taken for granted, but there is actually 
a specific requirement that must be fulfilled in order for energy to be conserved. Assume that we have a Lagrange function 
L = L(qi,qi,t) with a potential V = V(qi). Now the total derivative is: 


7 T / 7, ^ I ^ ^ I 


= £ 


dqi dt 
d dL 
dt dqi 


dqi dt 
dL 
~dt ‘ 


Moving the left-hand side over to the right side, we get the equation 

dH dL _ n 
dt + dt 


(2.42) 


(2.43) 
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where we defined the energy function (or the Hamilton function) 


dL 

= (2.44) 

i OQi 

It follows that if L has no explicit time-dependence, i.e. dL/dt = 0, then dH/dt = 0. To proceed, we note that the mathematical 
property that for a homogeneous function / of the n-th degree, Euler’s theorem dictates: 

i>!£="■/ <2 - 45 > 

i 

It so happens that the kinetic energy T is a homogeneous function of the 2nd degree, meaning that 

= 2 T. (2.46) 

i i 

To get this result, we used that dL/dqi = dT/dqi since the system is conservative. Using the above result, we then have 

H = 2T — L = 2T — (T — V)=T + V = total energy. (2.47) 


In other words: 


If the Lagrange function has no explicit time-dependence, dL/dt = 0 and the total energy of the system is conserved. 


> Apply now 
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III. HAMILTON’S EQUATIONS 

Youtube-videos. 11-12 in this playlist. 

Learning goals. After reading this chapter, the student should: 

• Understand how Hamilton’s and Lagrange’s equations are related and obtained from one another. 

• Know how to construct and solve Hamilton’s equations for simple model systems. 


A. Legendre transformations 

Upon introducing Hamilton’s equations in this chapter, we emphasize right away that these are equivalent to Lagrange’s equa¬ 
tions - there is no new physics involved, just a new method or technique. In terms of directly solving problems in mechanics, 
Hamilton’s equations are not better or worse than the Lagrange formalism. However, the Hamiltonian framework is more suit¬ 
able in other areas of physics, including quantum mechanics and statistical mechanics. In what follows, we shall consider 
holonomic systems with monogenic forces. According to our previous definition of these concepts, we then have V = V(q) or 
U = U(q, q) (the latter as in the case of an electromagnetic field). Even with these restrictions, the following analysis remains 
valid for a vast number of physical situations. 

Let us first briefly recap. The Lagrange formulation may be stated as follows: 

With n degrees of freedom, we have = 0, i = 1, 2,... n. We thus have n second order differential equations: a 

complete solution requires 2 n initial conditions, such as the values of qi and qi at one time t\ or alternatively the values of qi at 
two times t\ and t^. The state of the system is specified by a point in the n-dimensional configuratiotn space with axes 


Instead, the Hamilton formulation may be summarized as follows: 


With n degrees of freedom, we have 2 n first order differential equations. We thus still need 2 n initial conditions. The state of 
the system is specified by a point in the 2n-dimensional phase space with axes qi and pi where 


Pi = 


dqi 


1,2, ...n 


(3.1) 


The quantities q and p are known as canonical variables. 


Lrom a mathematical perspective, the transition from Lagrange to Hamilton formulation requires that we change the 
variables in our functions from (q, cp t ) to (q,p, t) where p = dL/dq. There actually exists a specific recipe to accomplish such 
a change in variables known as the Legrendre transformation which we now review. 

Assume that we have a function f(x,y) such that 


df = u • dx + v • dy (3.2) 

with u = df /dx and v = df /dy. We now wish to change basis from (x, y) to (u, y ) so that differentials are expressed via du 
and dy. Let us define 


g = f -ux (3.3) 

It is no coincidence that the new function is defined as the old one minus the product between the variables that we want to 
interchange , namely u and x in this particular case. In this way, we see that 

dg = df — u • dx — x • du = v • dy — x • du. (3.4) 


35 


Download free eBooks at bookboon.com 



Introduction to Lagrangian & 
Hamiltonian Mechanics 


Hamilton's equations 


This is the desired form. Since we also have generically dg = ( dg/du)du + ( dg/dy)dy , it follows that 

x = — dg/du , v = dg/dy. (3.5) 

The Legendre transformation is commonly used in thermodynamics. Let’s have a look at an example. 


Example 9. Use of Legendre transformation in thermodynamics. Enthalpy H (not to be confused with the Hamiltonian) is 
a function of entropy S and pressure p in the following way: 

dH/dS = T, dH/dp = V. (3.6) 

The enthalpy H = H(S,p) is useful in particular for isentropic and isobaric processes since it remains constant. However, if 
one instead is interested in describing isothermic and isobaric processes it is more convenient to use a function depending on T 
and p. We now know how to accomplish this - via a Legendre transformation. The new function is supposed to be the old one 
minus the product of the two variables we wish to exchange, S and T in this case. We thus define 

G = H -TS (3.7) 

so that 

dG = dH — T - dS — S - dT = T - dS + V - dp — T - dS — S - dT = V - dp — S - dT. (3.8) 

Here, G is the Gibbs free energy. 
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B. Going from Lagrangian to Hamiltonian formalism 


The natural Legendre transformation for going from the Lagrange to Hamilton formalism is then taking the difference between 
the product of the coordinates to be exchanged and the old function: 


H = H(q,p,t) = pq — L. 


(3.9) 


There are two ways to express the differential dH : 



(3.10) 


while from Eq. (3.9) we get 



(3.11) 


Since pi = dL/dqi, it follows that: 


dH = qi • dpi — pi • dqi -— dt. 


(3.12) 


It is useful to note here that by dividing the above equation on dt , it follows that dH/dt = —dL/dt. By direct comparison, with 
Eq. (3.10), we can now immediately write down Hamilton’s canonical equations: 


dH dH _ dL 
dqi ’ dt dt 


dH . 

Qi = , Pi = 


We may then summarize the procedure used in the Hamilton formalism to solve a given problem: 

• Construct L = L(q, cp t). 

• Define the canonical momenta pi = dL/dcp. 

• Construct the Hamilton function H = piqi — L. 

• Use pi = dL/dqi to express qi as a function of (g,p, t). 

• Eliminate qi from H such that H = H(q,p , t). 

• You can now use H to solve the canonical equations of motion. 

Let’s have a look at a practical example of this. 

Example 10. Hamilton formalism for particle in EM field. We know from previous considerations in this book that for this 
scenario we have L = T — U with U = qcj) — qA • v. The potentials 0 and A may depend on r and t. The Lagrange equations 
are satisfied with this V : 


d dL 
dt dxi 


dL 

dxi 


(3.13) 
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Using Cartesian coordinates and the sum convention, we may write 


The canonical momentum is then 


From this, we obtain the Hamilton function 


L = -mXiXi + qAi±i - q(j>. 


dL 

Pi = 7TT- = mXi + qAi. 

OXi 


H = piXi — L — ( mXi + qAi)xi — -mXiXi — qAi±i + </</>= -mXiXi + q(f>. 

This is simply mechanical (kinetic) energy pluss potential energy. We now get rid off ±i via 

1 


Xi = —{Pi~ qAi) 
m 


and upon insertion of this into H we end up with 


H = ^ {p - qA)2+q4> 


(3.14) 


(3.15) 


(3.16) 


(3.17) 


(3.18) 


where the dependence on Xi and t is in A and Thus, if A and are independent on t we have dL/dt = 0 and thus the 
Hamilton function is conserved: 


dH/dt = dH/dt = -dL/dt = 0. 


(3.19) 
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IV. THE TWO-BODY PROBLEM: CENTRAL FORCES 

Youtube-videos. 13-20 in this playlist. 

Learning goals. After reading this chapter, the student should: 

• Be able to explain and derive how a two-body problem with central force interactions can be reduced to an effective 
one-body problem. 

• Understand how different particle trajectories arise depending on the energy E of the particle and classify these trajectories 
accordingly 

• Have detailed knowledge on how to treat a particle moving in a Kepler potential 

• Understand the concept of a differential scattering cross section and be able to compute it for simple potential profiles 

V (r) 


A. Reduction to equivalent one-body problem 

A powerful result in treating a two-body system where the forces are central (i.e. acting only along the line connecting the two 
bodies) is that the system may be reduced to an effective one-body problem. We will now derive exactly how this equivalency is 
obtained. 


With two classical bodies, there are in general 6 degrees of freedom (3 d.o.f. for each body associated with movement in the three 
spatial dimensions). This means we need 6 generalized coordinates to describe the total system. We may choose for instance the 
CoM coordinate R and the relative coordinate r : 


R = 


miri + ra 2 r 2 
mi + m 2 


r = r 2 — ri. 


(4.1) 



If the force resulting from the interaction between the particle is central, we have a potential 

V = V(r), r = |r|. 


The Lagrange function is then 

L = T(R,r)-V(r). 

Let us consider the kinetic energy in more detail. We have 


T = 


1 .2 1 .2 

-m 1 r 1 + -m 2 r 2 


(4.2) 


(4.3) 


(4.4) 
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which after straightforward algebraic manipulation may be re-expressed in terms of R and r as follows 



. 2 

(m 1 +m 2 )R + 


1 m\m 2 .o 
- r 

2 mi + rri2 


(4.5) 


At this point, it is convenient to introduce the reduced mass p = m\m 2 /(mi + m 2 ) and the total mass M = m\ + m 2 . We then 
have 


L = 



(4.6) 


The crucial observation at this point is now that R is a cyclic coordinate: L is independent on R. It follows that the belonging 
canonical momentum pr must be a constant: 


U 1j 

Pr = —- = MR = constant. (4.7) 

dR 

• 2 

Thus, R is a constant and we may simply drop the term MR since adding or subtracting a constant to a Lagrange function has 
no physical consequence: it simply redefines the zero-energy level of the potential energy. With this simplification, we have now 
in fact reduced the initial two body problem to an equivalent one body problem since L now only depends on r and r: 

L = T — V = ^ fir 2 — V (r) (4.8) 

In other words, the physics of the system corresponds to a particle with mass /x moving with a velocity r (where r is the relative 
coordinate between the original two bodies) in a potential V (r). 


B. Equations of motion 

Since we will consider the one body Lagrange function, let us simply rename fi —>> m. We are then looking at a mass m in a 
central forcefield. The system is rotationally symmetric as the force only depends on the distance r and it follows that angular 
momentum must be conserved: 


L = r x p (4.9) 

We remind the reader that whenever a continuous symmetry is present, meaning that L is invariant under some continuous 
operation such as an arbitrary rotation, there must be a belonging conservation law. Both the magnitude and size of L is 
conserved, which can only be fulfilled if r always lies in a plane perpendicular to L. Central motion thus always occurs in a 
plane. For this reason, we only need polar-coordinates r and 6 to fully describe the problem. We then have 

L = T — V = i m{r 2 + r 2 0 2 ) - V(r) = L{r, r, 9). (4.10) 

Since 0 is cyclic, pe = mr 2 6 must be conserved. In fact, mr 2 6 = l is the magnitude of the angular momentum. One of the 
equations of motion is then pe = ^( mr 2 6 ) = 0 . We can now prove Kepler’s 2nd law, which demonstrates that for the present 
mass moving in a central forcefield, the radius of the trajectory of the mass sweeps over equally large areas for any equally large 
time intervals. 



It follows from the figure that dA = \r • r • dO. In effect, A = \r 2 Q = po/(2m) which is a constant. The second equation of 
motion comes from Lagrange’s equation for the coordinate r and reads: 

d , . ao dV 

— ( mr ) — mrO 2 + —— = 0. (4.11) 

dt or 
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The force in r- direction is f(r) = —dV/dr. We then have 


mr — mrO 2 = f(r). 


(4.12) 


One can now eliminate 0 via mr 2 9 = Z, and thus 


mr — 


l 2 

mr 3 


= f(r)- 


(4.13) 


This is a 2nd order differential equation in one single variable: r. For a conservative system, we know that not only Z but also 
the energy E is a conserved quantity: 


E = -m(f 2 + r 2 0 2 ) + V(r). 


(4.14) 


C. Equivalent one-dimensional problem 

So far, we have not specified the exact form of V (r). For an arbitrary V (r), the differential equation cannot be solved in general. 
However, it is possible to gain physical insight in the behavior of the particle m even in this case by utilizing an analogy for 
one-dimension. To do so, note that Eq. (4.13) may be rewritten as 

mr = f'(r ) = f(r) + l 2 /mr 3 . (4.15) 

This has the same form as Newton’s 2nd law for a one-dimensional problem where a mass m is affected by a force f'(r). The 
extra term in f'(r) besides the external potential f(r) is the centrifugal force. The corresponding effective potential V'(r) may 
be written V'(r) = V(r) +1 2 /2mr 2 . In order to see which consequence this extra term has, we may consider a specific example 
with V(r) = —k/r. It is instructive to plot the effective potential V'(r) as shown below. 
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We see that lim r ^o V'(r ) = +oo and lim r _^oo V'(r) = —0. The trajectory of a particle moving in this potential will depend on 
its energy E. We may distinguish between four particular scenarios. 




• E > 0: the particle cannot access radii smaller than r\ if its energy is E = Ei, while there is no upward bound on r. It 
thus has a turning point at r = r \. 

• E = 0: qualitatively the same type of motion as for E > 0. 

• E < 0: The motion is now completely bounded: the particle may only have positions within the radii r\ and r 2 , i.e. 

r\ < r < V 2 - 

• E = E m ir. : The particle can only occupy one particular radius r = r o, which means its trajectory must be a circle since 
r = 0. This scenario takes place when the force from the external potential matches exactly the centrifugal force. 
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D. The virial theorem 

We here briefly discuss a useful result known as the virial theorem, which for instance finds its use when discussing periodic 
motion (such as planetary motion). It is a general theorem valid for many different physical systems due to its statistical nature. 
Assume that the system under consideration has masses in positions and that the masses are affected by forces Fi, which 
includes any forces resulting from constraints. We then have p i = Fi and by defining the quantity 

G = ^2 Pi • Ti (4.16) 

i 


it follows that 


dG 

dt 


i i 


(4.17) 


We may rewrite this as 

d § = 2T + Y,F i -r i . (4.18) 

Now, let us average this equation over a time interval r, leading to ( 7TT denoting time-averaging): 

^[G{T)-Gm=2T + Y J F i -r i . (4.19) 

i 

If the motion is periodic and r is its period, we see that the left hand side is 0. If the motion is not periodic while r* and are 
always finite (which is reasonable), then lim^oo of the left hand side also gives zero. Thus, in both cases we obtain 

T=-^F i -r i . (4.20) 

i 

This is the virial theorem which thus provides a relation between the average kinetic energy of the system and the forces acting 
on its constituents. Let’s look at some examples. 


Example 11. Ideal gas. Consider an ideal gas with volume V and N atoms. According to the equipartition principle from 
statistical mechanics, we know that T = 3NkT/2. This is obtained by each particle providing an average kinetic energy of 
kT/2 for each degree of freedom in its motion. Let Fi be the force exerted on the atoms by the wall, so that dFi = —pndA 
where p is the pressure of the gas. Since we assume an ideal gas, interactions between the atoms themselves are neglected. We 
then obtain: 

\Y, F '- r ' = -\ P J n-rdA = - l -p J (V • r)dV = ~pV. (4.21) 
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Example 12. Particle in a central force field. For conservative forces, we have Fi = —ViV. Thus, T = • r*. 

Assume we have one single particle moving in a central field: 


T = 


2 0r r 


(4.22) 


If the potential is given generally by a power-law dependence on the radius, i.e. V = ar n+1 (so that the force itself goes like 
r n ), we obtain T = V. For the special case of a harmonic oscillator (n = 1), we get T = V\ In effect, the average kinetic 
energy is equal to the average potential energy. 


E. The Kepler problem 

We will now consider in great detail a potential of particular importance, relevant e.g. for planetary motion in our solar system. 
We take the force to be F(r) = —k/r 2 , so that the potential is V(r) = —k/r. This is the so called Kepler potential. The task is 
to determine the trajectory that a particle moving in this potential will take, i.e. to find r m r{6). 

We have that, generally, f = yj (2 /m)(E — V — l 2 /2mr 2 ). By using r = dr/dt and combining this equation with 6 = l/mr 2 
(where 6 = dO/dt ), we obtain 


d0 = 


Idr 


mr 2 J ~^(E — V — l 2 /2mr 2 ) 


(4.23) 
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This integral can be solved: 


0-0 o = 


f 


dr 


(4.24) 


So far, the treatment is valid for a general potential V(r). Now, we insert our Kepler potential V(r) = —k/r and use the 
substitution u = 1/r (so that du = —dr jr 2 ): 


0-0 o = - 


/ 


du 


2mE | 2 mku n ,2 

12 ^ 12 U 


(4.25) 


The integration constant Oq is determined by the initial conditions. For now, let us set Oq = 0. We will see from the final solution 
that this choice means that we are measuring the polar angle 0 relative the perihelion of the trajectory. This is defined as the 
point where the particle is closest to the center of the potential (r = 0). The above integral over u has a solution analytically, by 
using the following equation: 


/ 


dx 


1 


=acos [—(/3 + 2jx)/yfq\ with q = /3 2 — 4cry. 


\J ol + fix + 7a; 2 

Using this formula allows us to solve the integral in Eq. (4.25) by identifying a = 2mE/l 2 and so forth. We obtain 

l 2 /(mkr) — 1 


0 = — acos 


L^/l + 2El 2 /mk 2 i 


(4.26) 


(4.27) 


We can tidy up this expression quite a bit by introducing some useful parameters. First, we define the eccentricity of the orbit 
via 


e = \ 1 


2 El 2 
rnk 2 ’ 


Next, we introduce the so-called orbit parameter p = l 2 jmk. In this way, we may rewrite the above to: 

0 — —acos[(p/r — l)/e]. 


(4.28) 


(4.29) 


Straight-forward algebraic manipulation of this expression yields the final answer for r = r(0 ), describing the trajectory of a 
particle moving in a Kepler potential: 


r = 


P 


1 + e cos 0 

As announced earlier, we see that 0 = 0 indeed corresponds to r = r m i n = p/( 1 + e) which is the perihelion of the orbit. 


(4.30) 


The question is now: what does the trajectory described by Eq. (4.30) really look like? We may distinguish between four 
scenarios, in complete analogy with how we distinguished between particle trajectories depending on the energy E of the 
particle in a previous section. We find that 

• e > 1 (which corresponds to E > 0): hyperbola. 

• e = 1 (E = 0): parable 

• e < 1 (E < 0): ellipse 

• e = 0 (E = —mk 2 /2l 2 ): circle 

Let’s have a closer look at the case of an elliptical trajectory. 
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«--> 

1 *. 


The major half-axis is denoted a while the minor one is denoted b. The motion is bound since r lies between r± and 7 * 2 . It follows 
from the figure that: 


2a = n + r 2 = p/(1 + e) + p/( 1 - e) = 2p/(l - e 2 ). 
Inserting the expressions for p and £ that we identified earlier, one arrives at 

k 

a= 2\E\' 

The absolute value sign comes from the fact that E is negative. Similarly, one may identify 

b= 1 


(4.31) 


(4.32) 


(4.33) 


by using that the eccentricity e satisfies e = c/a where 2c is the distance between the focal points and that a 2 = b 2 + c 2 for an 
ellipse. 


With these considerations, we are now in a position to establish a useful relation between the period of orbit T and the size of 
the elliptical trajectory. We have previously shown that the ’’areal velocity” is constant: 


dA _ l 
dt 2 m 



(4.34) 


by integrating over one period T. For an ellipse, we have A = nab. Inserting the above expressions we derived for a and b , we 
end up with 


T = 2na 3/2 ^m/k. 


(4.35) 


This is Kepler’s third law, namely that T 2 ~ a 3 . 


F. Scattering cross section 

In order to define what the scattering cross section is and what it gives us information about, consider a scenario where we have 
a uniform flux of particles heading toward the center of some potential V(r). These ’’particles” could in reality be anything: 
electrons, a-particles, planets, so the situation at hand is quite general. Assume for simplicity that all particles have the same 
mass and energy. The potential V (r) is such that the resulting force f(r) = —dV/dr —>> 0 when r oo. We may characterize 
the incident flux of particles with the intensity I : 

I = the number of particles passing through a unit of cross sectional area normal to the current flow per unit of time. (4.36) 

The trajectory of the particles will be deflected from rectilinear motion when they get close to the center of the potential V (r) 
since it acts with a force on them. After the particles have passed through the potential, the force acting on them subsides and 
their trajectories eventually return to rectilinear motion again. 
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Based on the above scenario, we now define the differential scattering cross section a(Q): 


cr(Q)dQ = 


# particles scattered into solid angle df£ per unit time 
intensity of the incident flux of particles 


(4.37) 


Recall that for a solid angle, we have d ft = smOdOdcj). The unit of a (Cl) thus becomes m 2 . If we are dealing with central 
forces where the potential only depends on the absolute value of the distance from the potential center, i.e. V(r) = V (r), there 
exists a symmetry around the axis that defines the direction of incidence. In effect, we can integrate over d<fi and consider d£2 
= 27r sin OdO where 0 is the scattering angle. 



Let us consider a specific example where V (r) is repulsive. It is convenient to introduce the impact parameter s: 

l = \r x p\ = r _^oo= mvos = sV2mE. (4.38) 

For a fixed energy E and s , the scattering angle 0 will be uniquely determined. Assume then that different values of 5 give rise to 
different scattering angles 0 for the incident particles. We may then state the number of particles incident between s and s + ds 
must equal the number of particles scattered between 6 and 0 + dO. Expressed mathematically, this gives: 

27rls\ds\ = 2 tt a (0)1 sin 0\d0\. (4.39) 
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We have inserted absolute value signs due to the fact that one often has ds/dO < 0. This can be understood by noting that 
decreasing s, i.e. causing the particles to pass closer to the potential center, is likely to increase the scattering angle 6 since they 
feel a stronger force. We thus end up with 


<t(6>) 


s ds 
sin# dO 


(4.40) 


In order to find a(6), we have to find out how 8 and 6 depend on each other in order to compute ds/dO. We can do this by going 
back to our previous relation between the angle of the trajectory and the radius of the trajectory that we used in our treatment of 
the Kepler potential. Since we already used 0 for the scattering angle, so let us use /3 for the angle denoting a specific point on 
the trajectory. We have 



dr 

r 2 yj2mEjl? - 2mV/I 2 


1/r 2 


(4.41) 



From the figure, we see that 0 + 2^ = it. Now, ro = oc corresponds to /3q = 7r. Also, r = r m (the minimum radius of the 
trajectory, the so called perihelion) corresponds to (3 = tt — x/j. Using these relations in the general expression Eq. (4.41), we 
obtain 


(7 T — Ip) — 7T = 




Rewriting this in terms of s = 1/s/2mE , we get: 

z _ r°° sdr 

^ ~ Jr m r 2 ^\-V/E-s 2 /r 2 ' 


(4.42) 


(4.43) 


Substituting u = 1/r, we finally obtain the desired result which allows us to evaluate ds/dO : 


6(s) = 7T — 2 



sdu 

y/1 - V/E - S 2 U 2 ■ 


(4.44) 


Note that so far, we have not specified what the precise form of the potential V is and hence our result is generic. If V is a 
complicated function of r, then the above equation can only be evaluated numerically. Note that u m is a known quantity: it is 
given by 


l-V ( u m )/E - s 2 u 2 m = 0 . 


(4.45) 


This may be understood by noting that 


dr = r 2 V^nE ^ _ V j E z ( 4 . 4 6 ) 

and in the perihelion, we have dr/d/3 = 0 by definition. Let’s have a look at an example where the trajectory can be computed 
analytically, so that we obtain an explicit expression for s(6) and finally cr(6). 
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Example 13. Repulsive scattering between charged particles in the Coulomb-field. Consider positively charged particles 
with charges Ze and Z'e, respectively. 


m 

+ 

Ze 




If M ra, the center of mass will essentially coincide with the location of M. We thus assume that particle M is at rest in 
our reference frame (the lab system). The Coulomb force is f(r) = ZZ'e 2 /(47reor 2 ) and the Coulomb potential is V(r) = 
ZZ'e 2 /(47reor). This is precisely the situation considered in the Kepler problem if we define k = —ZZ'e 2 ! (47reo). We have 

E = T + V = ^mv 2 + V(r) > 0. (4.47) 

Due to conservation of energy, we also have E = \mnv q since when the particles are far apart the only energy in the problem 
is the kinetic energy of particle m. Based on our previous analysis, we can immediately state that the resulting trajectory of 
particle m will be a hyperbola with eccentricity 


c = 




2 El 2 / 47T60 \ 2 
m \ZZ'e 2 ) 


> 1 . 


(4.48) 
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Note that since l 2 = 2 s 2 mE. We can also see from our previous analysis of the Kepler problem that, generally, we have: 


p/r = 1 + ecos(/3 - /3 0 ). 


(4.49) 


For the case of an elliptical trajectory, we saw that choosing /3 q = 0 corresponded to measuring /? from the perihelion. Also, we 
had defined p = l 2 jmk. In our case, we have k < 0 and hence p < 0. This has the consequence that we have to choose /3 q = n 
in order for (3 = 0 to be the perihelion. We obtain 


or alternatively: 


\p\/r = e cos /? — 1 


(4.50) 


r = 


\p\ 

e cos (3 — 1 


(4.51) 



The asymptotes of the trajectory are for r oo: 

(3 ±vp 

which means that cos ijj = 1/e. The scattering angle is 6 = 7r — 2^. This means that 


We can rewrite this expression as 


and isolate s from this equation: 


Differentiating, we obtain 


cos(7r/2 — 0/2) — - 


2 Es 

cot(0/2) =47reo^^. 


8 = s(0, E 1 ) 


7^.2 


1 ZZ'e 
47T60 2.E 


■cot(0/2). 


dO 


1 ZZ'e 2 


47reo 4.E sin 2 (6 > /2) 

We may then finally write down the result for the differential scattering cross section analytically: 


1 \ 2 /'ZZ'e 2 \ 2 1 


= ^ = ( 1 e \ 

a sin# V47ren/ V 4.E / 


1 W2) 


(4.52) 


(4.53) 


(4.54) 


(4.55) 


(4.56) 


(4.57) 


This is known as Rutherfords scattering cross section. Interestingly, a fully quantum mechanical calculation (albeit non- 
relativistic) would yield exactly the same answer. 
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Once we have computed the differential scattering cross section, we can obtain the total scattering cross section a as follows: 

a = J <j($d)d£} = 2 tt J cr(0) sin OdO. (4.58) 

The total cross section a can be though of the net area that particles can be scattered into. With the Coulomb-potential, one 
finds that cr oo. The physical interpretation of this is that the Coulomb force has an infinite range, and so regardless of how 
large the impact parameter s is, the particle will be scattered and contribute to a. Quantum mechanically, one would find that if 
V 0 faster than 1/r 2 when r —>■ oo, a will be finite. 

It is also worth emphasizing that we assumed M > m so that the center of mass was at rest. Rutherfords formula is always 
valid in the CM-frame if one interprets 6cm as the angle between the incident and outgoing particle. In the lab-system, the angle 
between incident and outgoing particle will in general be different from 6 C m- 
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V. KINEMATICS AND EQUATIONS OF MOTION FOR RIGID BODIES 

Youtube-videos. 21-28 in this playlist. 


Learning goals. After reading this chapter, the student should: 

• Know about the transformation matrix 

• Know how the Euler angle can be used to determine the spatial orientation of axes in a rigid body. 

• Be familiar with infinitesimal transformations. 

• Be familiar with the Coriolis force. 


We will in this chapter consider rigid bodies, subject to the holonomic constraints that the distance between two arbitrary 
material points is always the same. The rigid body is in a strict sense an idealized model (it cannot exist in quantum 
mechanics, not even at zero absolute temperature, because of the zero-point oscillations). But it is a very useful model 
nevertheless. We will here deal with the kinematics of rigid bodies, i.e., how to specify the nature and the characteristics of 
their motion. The dynamics of rigid bodies is something different and will be considered later in this chapter: it concerns the 
motions as determined by the action of extraneous forces. That is, the equations of motion will then have to be taken into account. 

How many degrees of freedom has a rigid body, i.e., how many coordinates are needed to specify its position? Neglecting all 
constraints to begin with, the body consisting of TV particles has 37V degrees of freedom in all. This number is strongly reduced 
because of the constrains saying that the distance between particles i and j is fixed, = constant. The number of 

these constraints isl + 2 + ... + 7V = | (TV — 1), but these are not all independent. Actually, we need only determine the position 
of three specified points (not lying along the same line), plus the corresponding constraints . The three points are linked 

by three constraints of this type, so that the number of degrees of freedom is reduced from 9 to 6. With reference to the figure, 
consider the following simple reasoning: 

• We need 3 coordinates to specify the location of one point, say point 1; 

• Then we need two coordinates to specify point 2 (this point can lie on a spherical surface centered in point 1, with radius 


ri2)\ 


• We finally need one more coordinate to specify point 3 (this point can lie on a circle around the axis between 1 and 2). 
. This adds up to 3+2+1=6 degrees of freedom in all. 



3 


A. Orthogonal transformations and independent coordinates 

Let now x, y, z be the axes in a fixed “external” (lab) coordinate system and let x',y', z' be the corresponding axes in a coordinate 
system which is fixed in the rigid body. In addition to the three coordinates needed to specify the origin in the (x', y\ z') system 
relative to the (x, y , z) system we need the directions of x', y' relative to x, y , z. It is convenient to use the direction cosines 
ai, « 2 , of the primed axes relative to the unprimed. They are defined via 
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a\ = cos(i',i) = i' • i 
a 2 = cos(i'J) = i' j 
«3 = cos(i',k) = i'• k 

and analogous behavior with /3 for j' and 7 for k'. Since 

i' = (i' • i)i + (i' j)j + (i' -k)k, 

we have then 

i' = + a 2 J + a 3 k 

y = Pi\ + A>j + /3 3 k 

k ; = 71 i + 72j + 73k 

We can of course invert his process, that means, express i, j and k in terms of their components along i', j' and k': 

i = (i • i')*' + (1 • j ')}' + (i • k')k' = aii' + Pij + 7 i k ' 

and so on. 


(5.1) 


The direction cosines give the connection between arbitrary vectors in the two systems (x, y , z) and (x\ y ', z') (we assume the 
same origins in the two systems). For example, a position vector r will have an x'-component given by 

x' = r • \ = (xi + yj + zk) • i' = a\x + a 2 y + a 3 z, 

and an arbitrary vector G will have an 7 /-component 

G y ' = G • j' = (G x i + G y j + G z k) • j' = ftG x + /^Gy + /3 3 G Z . 

We have 9 direction cosines, but have seen that we need only 3 coordinates to determine the body’s orientation uniquely. The 
reduction can be done via the orthogonality conditions: 

i • i = (aii' + AJ' + 7i k ') 2 = a? + Pi + 7i = 1 

1 • j = (01 ii' + / 3 ij 7 + 7ik 7 ) • (a2i + @2} + 72k 7 ) = a\a 2 + P1P2 + 7172 = 0, 


etc. On a compact form: 


T~ 7^7m — $lrri 


(5.2) 


We can thus not use the direction cosines as generalized coordinates, for instance in a Lagrangian formulation, as they are not 
independent. (Later, we shall see that there are three independent independent functions of the direction cosines, called the 
Euler angles, which can be used for this purpose.) The direction cosines are yet useful, as they describe the relationship between 
Cartesian coordinate systems. 
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B. Transformation matrix and its mathematical properties 

We introduce now a more convenient notation by letting x,y,z ^ x 1 , £ 2 , £ 3 . The transformations then become 



This is a linear transformation which in general can be written 


X 1 

= OL\X\ 

+ 

a 2 x 2 

+ 

a 3 x 3 

x' 2 

II 

5 

+ 

P 2 X 2 

+ 

fax 3 

x[ 

= 71^1 

+ 

I 2 X 2 

+ 

13X 3 

il can be written 





= an^i 

+ 

a 12 X 2 

+ 

a\3X 3 

'2 

= a 2 \Xi 

+ 

a 22 x 2 

+ 

d23X 3 

'3 

= «31^1 

+ 

0 ^ 32 X 2 

+ 

Q j 33X 3 ) 
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where aij are constant coefficients (i.e., independent of x,x'). Introducing the summation convention, implying that repeated 
indices are to summed over, we can express this as 

x\ = a ij x ji i £ {1,2,3}. (5.3) 

The length of the vector r cannot be influenced by the transformation as it corresponds to a spatial rotation; thus 

x\x\ = XiXi => CLijCLikXjXk = XiXi 


which gives 


— 3jk 

By inserting a, /?, 7, we see that this is just the earlier 6 conditions from equation We define the rotation matrix as 


(5.4) 



On 

ai2 

<313 

A = 

a2i 

<^22 

<323 


_ G&31 

<332 

<333 _ 


with matrix elements . 


Example 14 . Example Two dimensions In two dimensions the transformation matrix is 


and the orthogonality conditions are 



0*12 

<322 


dik 


3jk 


With four matrix elements and three orthogonality conditions we are left with one independent variable which is natu¬ 
rally interpreted as the rotation angle p. From the geometry we have 

x[ = x\ cos ip + X 2 sin p 
x' 2 = —x\ sin ip + X 2 cos p 

which means 


Then 


an = cos p, ai2 = sin(^ 
^21 = — sin p, 022 = cos p 


We may check the orthogonality: 


cos p 


sin(^ 


-sin p cos p 


OiiOn O 21 O 21 =1 
& 12&12 + ^ 22^22 — 1 
ailOi2 + O21O22 = 0 



cos 2 p + sin 2 p 
sin 2 p + cos 2 p 
cos p sin p — sin p cos p 


= 1 
= 1 
= 0 
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which is seen to be satisfied. 



As a general point we observe that the transformation equation f = Af can be considered in two different ways: 

1. Passive transformation: A is looked upon as an operator that rotates the coordinate system (counterclockwise in the 
example above), while the vector f itself is unchanged. We find thus the components of fin the rotated coordinate system. 
As we will see, this is the usual way of looking at transformations in special relativity, as Lorentz transformations can be 
pictured as rotations in a four-dimensional spacetime. 

2. Active transformation: A is looked upon as an operator that rotates the vector f, while the coordinate system is un¬ 
changed. We thus find a new vector f in the same coordinate system as before (we must rotate the vector clockwise in 
order to get the same equations f = Af as above). This is a way of interpretation often used in quantum field theory. 


C. Formal properties of the transformation matrix 

Let us look at two successive transformations: 




With use of the summation convention we can write this as 

X k ~ hjXj 
x i ^ ik x k 

x i — ^ik^kj x j = CijXj^ Cij — C^ik^kj 
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We thus see that two orthogonal transformations A and B following each other are equivalent to one transformation C such that 
C = AB. It can be verified that also C is an orthogonal transformation. In general one has 


so that the transformation is non-commutative. Further, one has 


showing that the transformation is associative. 

So far, we have dealt with quadratic matrices. We now introduce column matrices: 



Xi 


x i 

X = 

X2 

, x' = 

x' 2 


_^3 _ 


.4 


The matrix Ax becomes therewith a column matrix with elements (Ax)* = UijXj = x\ — (V)i, that means, 

x' = Ax. 

Note that we have not done anything else than writing the vector rasa column matrix x, where the number of elements is the 
same as the dimensionality of the space under consideration. 
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The inverse transformation is written as A 1 , with matrix elements a- 1 [note: a- 1 is the (i, j)- -element of A , not “1 /a*/’!]. 
The transformation A -1 shall bring x back to x: 


thus: 


Xi 

II 

A 

u j 

=> 4 

— (kki%i — 

=> CLki&ij 

^ J 

II 

(AA^) kj 

^■kj 


> 

> 

1 

II 

1=1 


A 4 


where I is the unit matrix, also called the Kronecker symbol. In three spatial dimensions it reads 

1 0 0 
0 1 0 
0 0 1 


From Xi = ori:r' = a^a jk x k we get a^a jk = 5 ik , i.e.,. 


I, which means 



(5.5) 


Now consider the double sum . By using the orthogonality conditions a^a/^ = 5u we see that this becomes equal 

to . Alternatively, we can exploit that a^a 4 1 = 5kj to see that the double sum can also be written as dji. Accordingly, 
ar 1 = dji. But dji = aij , thus the (/, j) -element of the transposed matrix A. This means that 



(5.6) 


Again using the summation convention we write AA = I 

— 5ik 
&kj — 5ik 


kij (kj k 


= 5jk and AA = I: 


kijCijk — 5ik, so that 


sum over first index 
sum over second index 


(5.7) 


Finally, we consider the determinant |A| of the (assumed quadratic) matrix A (the symbol || means the determinant, not the 
modulus!). From courses in mathematics we know that 


Since AA = I, one has |A| • |A| = 1, and as the value of the determinant does not depend on the interchange between the 
interchange columns, we get |A| = |A|, and therewith 

| A| 2 = 1 for all orthogonal matrices (5.8) 

This implies that |A| = e 10 , with 0 < 0 < 2i r. If |A| is a real quantity, we get |A| *= ±1. 


D. Euler angles 

We found above that the 9 cannot serve as independent coordinates since a rotating body with one point fixed can have only 
3 degrees of freedom. By means of 6 orthogonality conditions we managed to reduce the number of coordinates to the right 
number of 3. In addition we have always one extra condition, namely that the transformation shall be possible physically. This 
implies that the rotation matrix goes continuously over to the unit matrix, corresponding to no rotation at all, when the rotation 
angle goes to zero. Mathematically, this means that |A| = |I| = +1. We cannot have |A| = —1 if the transformation shall be 
physically realizable. 
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As a simple example we may consider the matrix 


S = 


-10 0 
0-10 
0 0-1 


which implies a reflection of the coordinate axes: x' = §x =>* x' = —x,y f = -y,z* — —z. This transformation is excluded, as 
is reasonable also from the fact that § changes a right-handed coordinate system into a left-handed one. 

We have to find 3 independent coordinates necessary to fix the orientation of the rigid body, in such a way that the orthog¬ 
onal transformation matrix A satisfies |A| = +1. The most common choice is to introduce the so-called Euler angles. The 
transformation consists in three successive rotations involving three angles, called <p, 0, ip, in a specifies order: 



1. xyz —»> by a rotation an angle 0 in a positive (counterclockwise) direction around the z-sods. Thus x" = Dx, with 



V 


X 

x" = 

V 

, X = 

y 


_c_ 


z 


2. £,vC —» CVC' by a rotation an angle 6 in positive (counterclockwise) direction about the £-axis. The £-axis is called the 


line of nodes. Thus x'" = Cx", with x'" = 


e 

Tjf 

C' 


3- -A x'y'z' by a rotation an angle ip in positive (counterclockwise) direction about the ("'-axis. Thus x' = Ex'", 


Let us consider in more detail D, the first of the three transformation matrices. It describes a rotation about the z-axis: 


D = 


cos (p sin <p 0 
—sirup cos <p 0 
0 0 1 


(5.9) 


The reason for the number ’ V in the lower right corner is that it specifies which axis the rotation takes place about. In this way 
we manage to describe an initial two-dimensional rotation in a three-dimensional space. 
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The second matrix describes the rotation about the £-axis (the line of nodes): 


C = 


1 0 0 
0 cos # sin# , 
0 —sin# cos# 


(5.10) 


where the number ’ V in the upper left corner specifies that in this case the rotation axis is the line of nodes. 
Finally, the matrix B describes the rotation about the C'-axis: 


cos ip sin ip 0 
—simp cos?/’ 0 

0 1 


0 


where evidently the ’ V to the lower right refers to the C'-axis. 

The composite transformation A is built up from these elementary matrices, 


where the equation is to be read from right to left. Multiplying out the matrices we get the general form 


A = 


cos ip cos <p — cos # sin p sin ip cos ip sin <p + cos # cos <p sin ip 
—sin ip cos (p — cos # sin (p cos # —sin ip sin (p — cos # cos (p cos ip 
sin# sin <p — sin# cos <p 


sin ip sin # 
cos ip sin # 
cos # 


(5.11) 


(5.12) 


(5.13) 


The inverse transformation x = A x x' is given by A 1 = A, which follows by interchanging lines and columns in A. 




The Graduate Programme 
for Engineers and Geoscientists 

www.discovermitas.com 


I joined MITAS because 
I wanted real responsibility 


MAERSK 


Real work 
International opportunities 
Three work placements 


0 


a 


I was a construction 
supervisor in 
the North Sea 
advising and 
helping foremen 
solve problems 


□ 



Download free eBooks at bookboon.com 













Introduction to Lagrangian & 
Hamiltonian Mechanics 


Kinematics and equations of motion for rigid bodies 


E. Infinitesimal transformations 

Two successive transformations can be described as a product of two matrices, A®. We know that matrix multiplication is in 
general non-commutative, i.e. A® / ®A. This is illustrated by an example shown in the figure, where a rectangular box is 
subject to two rotations, but in different order . 



So far, we have considered finite transformations. We shall now see that, in contradistinction to most finite transformations, 
infinitesimal transformations are commutative. Consider, on tensor form, the infinitesimal transformation 

%i — + ^ij^j — H - &ij ) % j 7 | j j 1. 


On matrix form, 


x' = (I + E)x. 


where the matrix E is composed of the elements e ZJ . Now consider two successive transformations, 

(I + Ei)(I + E 2 ) = I + EJ + IE 2 + • • • 

= I + Ei + E 2 , 


since terms of second order in infinitesimal transformations can be neglected (E being an “infinitesimal” transformation means 
just that the are so small that O(ef -) terms are negligible). As I + Ei + E 2 = 1 + E 2 + Ei, we get 


that is, 


(I + Ei)(I + E 2 ) = (I + E 2 )(I + Ei), 


(5.14) 


Infinitesimal transformations are commutative 


(5.15) 


It follows immediately that the inverse transformation is 

A -1 = I — E, 

because AA -1 = (I + E)(I — E) = I, where we again neglect terms 0(e 2 ). We know from before that the transformation matrix 
is orthogonal, 

A = I + E = A" 1 => E = —E 
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&ij — — &ij •) (5.16) 

that is, the matrix E is antisymmetric. A general infinitesimal antisymmetric matrix has thus only three independent elements; 
Let us call them dQi, dfls. It is natural to arrange them cyclically in the matrix E as 


and thus 


E = 


0 dVt 3 — 

—d£l 3 0 dVt\ 

dVt2 — dOi 0 


x' — x = dx = Ex 


1 

o 

to 

co 

1 

to 

to 


Xi 

—dVt 3 0 dfli 


X2 

d£l 2 — d£l\ 0 


_ X3 _ 


On component form, with use of the summation convention, this becomes 


(5.17) 


dx j, — 


Alternatively, we can write it as 


dr = r x dCl. 


(5.18) 


Note that the quantity dCl is a differential vector (not the differential of a finite vector. There exists no vector of which dtt is the 
differential. A finite rotation cannot be represented by by a single vector). Let us compare with the equations for rotation found 
earlier: dfl can be interpreted physically as a small vectorial change 

dtt = n 


where is a small angle. Thus 


dr = r x n d& 



F. The rate of change of time-dependent vectors 

Let us now use the above results to describe the rate of change of a vector. Consider a rigid body rotating with an instantaneous 
angular velocity c j = dil/dt when seen from an ’absolute’ coordinate system outside the body. Such a system is also called an 
inertial system, characterized by the validity of Newton’s ordinary equations of motion. We let such a system be designated by 
a subscript s. Let G be an arbitrary vector (= r, v, L, etc.) Because of the rotation of the axes of the body, the rate of change of 
G will be perceived differently in the absolute system and in the relative (comoving) system whose axes are fixed in the body. 
Let quantities referring to the relative system be given a subscript ’r’. 

Assume first that G is fixed in the body, so that dG r = 0. Then, we can find dG s from the formula for pure rotation, 

dG s = dflxG 
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In general, G r as seen in the body frame may change also. Thus we obtain as a natural generalization of the last equation, 


The rate of change of this quantity is 


dG s = dG r + dfl x G 




■f w x G, 


(5.19) 


where uo = dfl /dt is the instantaneous angular velocity. As G is a general vector we can write this as an operator relation, 


(^)s - + WX 


(5.20) 


For example: G = r => v s = v r + uj x r. 


G. Components of u along the body axes 

It is often useful to know the components of the angular velocity uo along the body axes x',y', and z'. The corresponding 
transformation can be taken as 3 successive rotation axes with angular velocities respectively = <j) : uoo = 6 and = ip. We 
use the theory from above to find the components: 
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, and 


corresponds to a rotation about the z-axis, i.e. oo$ = 


Inserting the components of A found earlier we get 

(co ( p) x f = 0 sin 0 sin - 0 , i^^y' — ^sinflcos^, (^<f>) Z ' = (jo cos 6. 


ujq corresponds to a rotation about the £-axis, i.e. ujq = 


, and the transformation becomes 


oo e = B oo q. 

. Inserting the components of B found earlier we get 

(ooe)x' = 0 cos ifo, {ue) y ’ = -0sin^, (coe)z' = 0 

As oo^ corresponds to a rotation about the C'-axis, and thus about z', is no further transformation necessary.We get simply 


^0 _ 


Adding the three contributions, we get finally 


On vector form, we thus have 


oo x > = <jo sin 6 sin ip + 0 cos ^ 
ooy> = 0 sin 0 cos 'i/o — 0 sin i/o 
oo z > = (jo cos 0 + 'ip 


oo = oo x > i' + ooy'j' + oo z /\d 


(5.21) 


H. The Coriolis force 

We go back to the operator relation ( d/dt) s = (d/dt) r + oo x, letting subscript s refer to the absolute space as before, and letting 
subscript ’r’ now refer to the earth as a rotating rigid body. We assume that the time dependence of oo is negligible. 

We first apply the operator relation on r, giving 


v s = v r + oo x r, 

as noted above. We next apply the same operator relation on r s : 


(5.22) 


d 

dt Vs 


As IA( W x r )] r = w x v r> we get 


d , 

—v s | + U X v s 

d d , x 
df Vr + * r) 


oo x v r -\- oo x (cc x r). 


a s = a r + 2cc x v r + cc x (oo x r) 


(5.23) 
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In the absolute space Newton’s relations hold: F = ma s . Thus we obtain 


F e ff — TTl3. r , 

F e ff = F + 2mv r x c u 

— muj x (w x r) 

" -v-' 



Coriolis force 

Centrifugal force 


This expression shows that the Coriolis force gives a deviation to the right on the Northern hemisphere and to the left on the 
Southern hemisphere. The angular velocity of the earth is uj = 7, 29 • 10 _5 s _1 . Letting the axes in the earth and in the absolute 
space be coincident at a specified instant, we find that ruj 2 = 3.38cm/s 2 is the maximal centripetal acceleration. The Coriolis 
force is of importance for the wind systems around the earth. Thus on the Northern hemisphere, this force tends to make the 
winds circulate in a counterclockwise direction around a low-pressure region. 




I. Angular momentum and kinetic energy 

As we have seen, we need 6 independent coordinates to describe a rigid body: a convenient choice is three spatial coordinates 
to fix the position of the center of mass (CM), plus three additional coordinates (the Euler angles) to specify the orientation of 
the body axes relative to CM. This formalism can be looked upon as a preliminary for describing the motion of the body when 
acted upon by extraneous forces. We may place the origin in whatever point we like, but a natural choice will often be the center 
of mass. Consider the rotation of a rigid body. Its angular frequency c u is the same for all points, so that it can be taken as a 
characteristic property of the body as a whole. 

The total angular momentum about the chosen fixed point reads: 

L = ^ m^r* x v»). (5.25) 

i 

With a pure rotation we have v* = u> x i-*, so that 

L = yy TO *Ti x (u x r») = y]mj [wr- - r*(rj • w)] . (5.26) 

i i 

Consider one component of L, for example L x : 

L x — ^ ^ %i(XiU) x + Ui^y ZiCU z ^ 

= y [uj x (rf - xf) - u y Xiyi - u z XiZi] . 
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Evidently we get analogous expressions for L y and L z , and we see that the components of L are linearly related to the compo¬ 
nents of uj. This we can express as (summation convention used) 

Lj = IjkOJk, ( 5 . 27 ) 

where Ijk er inertia tensor (or matrix). From the above expression for L x we conclude that 

Ixx = ^m^rl-x f) 
i 

I xy = ~ X] rriiXiVi 


and so forth for the other components. With a continuously distributed mass we substitute mi p(f) and JT 

Ixx = f p(r)(r 2 -x 2 )dV 
Jv 

I xy = - / p{r)xydV. 

Jv 

Letting x, y, z —>> x \, £ 2 , ^3 we obtain the general expression 


J v dV, so that 



( 5 . 28 ) 


With the usual notation for tensors we get the equation 


L = I • uj 


( 5 . 29 ) 
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The kinetic energy for the motion about the fixed point is 

T =\'A miV i = \ A m * v * • v * = \ A m * v * ■ x r *) 

i i i 

= \A miW ■ ( r * x Vi ) = ■ L ’ 

where we have used v* — uj x r^, as (v^) r = 0. Using L = I • ujL we get 


which can also be written as 


T = -uj * I - UJ 
2 


T — ^LUjljkLUk — ^IjkUjLdk- 


Taking n as the rotation axis (i.e. uj = ccn), we have 

T = \rijuljknku = ^(rijl jk n k )u 2 = ^/w 2 , 
where we have defined the moment of inertia tensor around the rotation axis: 


(5.30) 


I = rijljkrik = n • I • n 

From the expression for the inertia tensor we see that Ijk = Ikj , i.e. I is a symmetric matrix. Further, all components Ijk are 
real, and thus I becomes a Hermitian (self-adjoint) matrix. For a Hermitian matrix one has 

A = A f = (A)*. 


Thus, we can always diagonalize I, i.e. find a coordinate system where the matrix is diagonal: 


1 = 


h 0 0 
0 I 2 o 

00/3 


where the elements are the Ji, J 2 and Is principal moments of inertia. We will denote the corresponding principal axes by 
#i, X 2 , £ 3 , with angular momentum components 


L\ — I\UJ\, L2 — F2CC2 5 £3 — 


and kinetic energy 


T = - Vf 

9 - 


2 


For example, a symmetric spinning top has I\ = I 2 ^ Is, and a spherically symmetric top has I\ — I 2 = Is = I L = /cc. 

J. The Euler equations 

Let us derive the equations of motion for a rigid body rotating about a fixed point. From earlier considerations we know that the 
equation 



holds in an inertial system, i.e. without rotation. Here, we use N for the torque in contrast to the previously used r. We recall 
also the operator equation 




UJ X 

-\- UJ X 
r 


L, 
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thus in the rotating system (we drop hereafter the subscripts s and r): 


dL t 

—-— UJ X L — i\T, 

at 


which are the Euler equations on vector form when one point is fixed. 


(5.31) 


Example 15 . Body axes as the principal axes Assume that the body axes are coincident with the principal axes. Then, 

L\ = / 1 CC 1 , L 2 = 12^2, L 3 = I 3 UJ 3 . 

The i -component of the Euler equation is 

L'i H - = , 

On component form this means 


hwi — <^2^3 (^2 — I3) = Ni 
I 2 U 2 — <^ 3^1 (^3 _ II) = N 2 
I3U3 — UiLd2{Il — I2) = ^3 


K. Free rotation of rigid body; precession 

We will consider a rigid body that rotates freely, and describe its precession around the rotation axis. Assume that there is no 
external torque, N = 0. The Euler equations then give 

hui = ^ 2^3 (h - h) 

I2OJ2 = ^3^1 (^3 — ^1) 

I3U3 = Ld\UJ2(Il — I2) 

There are two constants of motion, namely the kinetic energy and the angular momentum. Assume that the body is symmetric, 
I\ = / 2 . Then, 


hu\ = 0 ) 2 ^ 3 ij-i ~ I 3 ) 

Il &2 = — CJ 3 CUi(/i — I 3 ) 

I 3 U 3 = 0. 

From the last equation we see that CC 3 is a constant, determined by the initial conditions. The two other angular frequency 
components uj\ and UO 2 are determined by 


Cj% = — 


CC2 = 


where Q is a new angular frequency defined as 


O J 3 - 
O = —--cu 3 . 

ll 

We now eliminate CC 2 by combining the above equations, 

uji + £l 2 uji = 0 

=> CCi(f) = CC_L COS CC2(^) = sin 
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We see from this that the vector uj ii + has constant magnitude and rotates (precesses) about the body’s z- axis with angular 
frequency Q. As ujs is a constant, uj has constant length and precesses about the rotation axis. The precession is relative to the 
body axes, which by themselves rotate with a higher angular frequency uj. 

The kinetic energy is 


T — -I\ (uj\ + uj%) + -^ 3^3 ~ ~ const. 

L 2 = l\ uj]_ + I\jj\ = const. 

Thus uj _l and ujs can be expressed in terms of T and L. We see that if I\ ~ / 3 , then Q <C uj. 


Example 16 . Rotation of the earth As is known, the earth is a little flattened at the poles (it has an oblate form), so that by 
letting the z-axis be the polar axis we can write Is > I\ = I 2 . 


As is moreover known, 


n 


h-h 

h 


* ^ 3 . 


27r 
cu 3 


1 day. 


This leads to a precession period of ^ • 300 = 300 days. The observed period is 440 days and is called the “Chandler 
wobble”. The deviation from the predicted period is attributed to the elastic properties of the earth; the earth is not perfectly 
rigid. 
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L. Heavy symmetric top with one point fixed 



We consider a heavy symmetric top spinning about its principal axis in the gravitational field. The gravitational acceleration is 
g. One point on the symmetry axis is fixed. The coordinate system x'y'z' is now taken to be fixed in space so that g = —gz\ 
whereas xyz lies fixed in the body such that z is the symmetry axis. The geometrical symmetry implies that 

h = h ± h, (xix 2 x 3 ) = (xyz). 


The kinetic energy is 

T = 2 IjUj2j = 2^^ + ^2) + 2^3- 

We make use of the Euler angles </>, ip as (generalized) independent coordinates. From earlier we have 


This gives us 


The potential energy is 



ui = 

(p sin 0 sin pj - 

F 6 cos ip 




CC2 = 

4> sin 6 cos ip 

— 0 sin pj 




CJ 3 = 

(p cos 6 + ip 




2 1 2 

( 'jJ CJ 2 — 

0 2 sin 

2 o + o 2 , 

Ccf = (<p COS 

0 + 

F 2 

=> T = 


> 2 sin 2 0 + Op 

) + y(0 cos 

9 + 

F 2 

V = - 

-miVi 

■ g = -MR 

• g = Mgl cos 6 



We can thus write down the Lagrangian, 

L = T-V= I ±(J> 2 sin 2 9 + 9 2 ^j + y (0 cos 9 + ip) 2 ~ Mgl cos 9 

It is seen that L is independent of (p which ip; these are accordingly cyclic coordinates, and the corresponding generalized 
momenta p^^p^ are constant in time. This can also be seen by inspection: N = Rx Mg are directed along the line of nodes, 
while z and z' both are orthogonal to the line of nodes. Thus N will have no component along 2 and z\ meaning that there is 
no change of angular momentum about these these axes. The remaining generalized momenta are 

/ 3(0 cos 0 + ip) = / 3 CU 3 = I\a = time constant 
(ii sin 2 0 + h cos 2 6 )(j) + Is'tp cos 0 = I\b = time constant 


dL 

p * = di, = 
dL 

P<p = ttt = 


70 


Download free eBooks at bookboon.com 





Introduction to Lagrangian & 
Hamiltonian Mechanics 


Kinematics and equations of motion for rigid bodies 


The constants a and b are defined above. In addition, the total energy is constant as the system is conservative: 


E = V + T = ^ (j ) 2 sin 2 # + # 2 ^ + ^ (0 cos # + ^) 2 + Mgl cos # = const. 


/ 3 , 


The equations for p$ and are now solved with respect to I^ip and combined. We get 

/i^sin 2 # + Jiacos# = /i# 


which gives 


/• Iia a 

yj = — cos # 

r 3 sm 


• b — a cos # 

V = - 3 - 

sin 2 # 
b — a cos # 


2 # 


The kinetic energy in the rotating system becomes 


2 3 2 2 sin 2 9 


Mgl cos # 


This expression is equivalent to a one-dimensional problem in the variable #, with an effective potential 

Ii (b — a cos #) 2 


V\0) = Mgl cos # + 


sin 2 # 


We substitute u = cos #, and introduce new constants a = 2£ ,/ / I\ and /3 = 2 Mgl /ii. The equation for E' can then be rewritten, 
using sin 2 # = 1 — cos 2 #, to give 


E' 


h u h(b-au ) 2 ^ 7 

+ . 9 / + Mglu 


2 sin 2 # 


sin 2 # 


i^(l — r^ 2 ) = -^u 2 + ^ (b — art) 2 + — -u 2 ) 

> a(l — ?x 2 ) = fi 2 + (6 — art ) 2 + /3u( 1 — ?x 2 ) 


•2 


= (1 — rt 2 )(a — /3iz) — (b — au ) 2 


= (3u 6 — (a + a 2 )ir + (2 ab — /3)u + a — b 2 = f(u ) 

The roots of the right-hand expression give u = 0, i.e. # = —uj sin# = 0. This gives the values of the angles for which # 
changes sign. 



We have 


/(±1) = —(b =F cl ) 2 < 0 and 
lim f{u) = ±oo 

u —>-±oo 

which means that there exists at least one root us > 1 which is nonphysical (|^| = | cos#| < 1 when # is real)! Physically 
acceptable values are obtained when f(u) = u 2 > 0 , i.e. u between u\ and U 2 (which both have absolute values less than 1 ). 
Thus, # can only take values such that cos # E ('Ui, U 2 ). The motion can be illustrated by a curve which the z-axis draws on the 
surface of a unit sphere with center in the fixed point: 
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What is the condition for a regular precession to occur? The angle 0 is then a constant, 6 {t) = 0(t = 0) = 6 $. Moreover, 
0i = #2 = 0o, so that f(u) must have a double root. We get two equations, f(uo) = 0 and df(uo)/du = 0. 

The first equation gives 


a — /3uq 


(< b - a?xo ) 2 

l-u§ 


and the second, df(uo)/du = 0, gives 


(3 a(b — auo) a — 

2 l — ^o 1 — Uq 
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Combining the two equations we get 


From above we have 


so that 


(3 a{b — auo) (b — au ^\ 2 

2 1 - «o U ° V 1 - u o J 


b — a cos 0 o b — auo 
sin 2 #o 1 — Uq 


^ = acj) — </> 2 cos Oq. 


Inserting (3 = Mgl/Ii and a = I3U3/I1 = I^{^> + 0 cos#o)/ii we get 

Mgl = 0(/ 3 cu 3 - Zi^cos # 0 ) 

^ • _ /3CJ3 zb — AMglli COS #0 

2/1 COS #0 


We require 4> to be real, so that the radicand has to be non-negative: 

if > AMglli cos 6 >o. 


From this we can conclude 

• for #0 > 7r/2, i.e. cos #0 < 0, a regular precession is possible for arbitrary values of uj 3 , 

• for #0 < 7r/2, i.e. cos #0 > 0, a regular precession is possible only if U 03 > j^\/MglIi cos # 0 . 

The two roots of 0 are associated with “slow” and “fast” precession. 


Let us go back to Mgl = ^>( 13^3 — Ii<j) cos # 0 ). With very slow precession one has I\(\) cos Oq <C / 3 CU 3 , i.e. 


Mgl 

I 3 U 3 


With fast precession the situation is reversed, as Mgl <C terms on the terms on the right hand side. In this case we get the root 


I 3 U 3 
Ii cos #0 


If f(u = 1) = 0, then 6 = 0 is a “bouncing angle”. We shall consider this case more closely. Assume that 6 = 0 at t = 0. Then 
= p^, and therewith a = b according to our definitions above. 

We now obtain for E'\ 


I 3 o h AO ha ( 1 -cos 0) 2 


E r = E — — ujI = — <9 2 +— lim 
2 3 2 2 <9^0 

=0 '- 


sin 2 0 


3 -Mgl — Ad gl 


=0 


2 E' 2 Mgl 

“ s ir = or =A 


•2 


= (1 — ?x 2 )/3(1 — u) — a 2 ( 1 — -u) 2 = (1 — ?x) 2 [/3(1 + ?z) — a 2 ] , 


that means, u = 1 is a double root and we get ^3 = a 2 /(3 — 1 . 
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All together: 

2 

• > 2 (fast precession) U 3 > 1 => only possible with u = 1 , i.e. 0 = 0 , 

2 

• %- < 2 => 1^3 < 1 => nutation between 0 = 0 and 0 = 63 . 



In the limiting case a 2 / /3 = 2 we get 


2 . ,/2 




2Mglh 


L0 


= 2 

= ly/Mglh 


We thus see that the spinning top is not only a simple toy! 
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VI. SMALL-SCALE, COUPLED OSCILLATIONS 

Youtube-videos. 29-32 in this playlist. 

Learning goals. After reading this chapter, the student should: 

• Understand how a coupled system of oscillators can be mathematically analyzed 

• Know how to compute the eigenfrequencies of a system 

• Understand the concept of normal-coordinates and their relation to eigenfrequencies 


A. Coupled oscillators 


As a preliminary to the concepts that we will introduce in this chapter, let us briefly recap some of the main points for a ID 
harmonic oscillator. It is described by V(x) = kx 2 /2 so that the curvature is given by d 2 V/dx 2 = k. The resulting force is 
F = mx, giving rise to the equation of motion 

mx + kx = 0. (6.1) 

The solution is x(t) = Re{Ae _lu;ot } where cco = \fkjm. This is a harmonic oscillation with frequency ujq. We may include 
friction, corresponding to a damping which eventually terminates the oscillation, by adding a force proportional to the velocity 
of the particle: Ff = —Xx. We obtain 

mx + Xx + kx = 0. (6.2) 

The solution reads x(t) = e _At / m R e{Ae~ luJot } which is then a damped harmonic oscillation. In the rest of this chapter, we will 
see how these concepts generalize from one single harmonic oscillator to a set of coupled harmonic oscillators. 

Assume that that we have a conservative system, i.e. V only depends on position. Any constraints that are present are taken as 
time-independent. As usual, we may consider a general scenario with N ’’particles” and thus 3N degrees of freedom. With k 
constraints, the number of degrees of freedom is reduced to n = 3N — k. We may then describe the system via the n generalized 
and independent coordinates (gi, # 2 , • • •, Qn)- 


A system in equilibrium is obtained when all generalized forces vanish: 



(6.3) 


In other words, V has an extremal value in the equilibrium configuration (go,i 5 #o, 2 > • • •, Qo, n )- A stable equilibrium is charac¬ 
terized by the fact that a small perturbation from the equilibrium configuration only leads to a minor bound movement around 
this configuration. In contrast, unstable equilibrium is characterized by a small perturbation from the equilibrium configuration 
yielding unbound motion. 
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We will here be concerned with small deviations from stable equilibrium, thus giving rise to a bound movement. We write 
Qi = Qo,i + Vi where rji represents the deviation from the equilibrium position qo^. Since {go,i} are constants, we may choose 
rji as new generalized coordinates. It is useful to Taylor-expand V around go,i which provides: 


V(Qi,Q2, • • • ,Qn) — V{qo,i,qo,2, • • 



1/ d 2 V 

2 \dqidqj 



(6.4) 


Note that we are using the summation convention with repeated indices. The first order derivative vanishes due to (dV/dqi )o = 0 
in equilibrium, as stated above. We also have the freedom of choosing our reference level for energy as we please, so might as 
well set V (</o,i, go ,2 , • • •, qo,n) = 0- This means that our zero-energy level is the equilibrium configuration. We are left with: 





(6.5) 


It follows that we may set Vij = Vji. 


The kinetic part of the energy can be written more generally by making it a quadratic function of the velocities: 


T = 2 m ijqiqj = ^ijViVr 


Note that the coefficients rriij can depend on the coordinates qk : 


mij(qi,q2,'",q n ) 


mjiqo^qofr 



( 6 . 6 ) 


(6.7) 


For instance, if we write the kinetic energy T in polar coordinates, we know that T = mr 2 /2 + mr 2 0 2 / 2. Comparison with the 
above allows us to identify mu = m, 777,22 = mr 2 , 777,12 = ^21 = 0. Now, since we’re interested in small deviations rji from 
equilibrium, we only keep terms up to second order of rji in T: 



Tijfjifjj where = rriij (q 0jl , go, 2 , • • •) 


( 6 . 8 ) 


Often, Tij will be diagonal: = T^ij. 

We are now in a position to write down the total Lagrange function: 


L{rj,fj) = T-V 


2 (TijViVj 


VijViVj )■ 


(6.9) 


Using Lagrange’s equations 


d dL 8L n . 

~J1 O • "o 65 7, 1,2,... 77, 

dt OTji OTji 

we obtain the equations of motion determining the time-evolution of the deviations rji : 


( 6 . 10 ) 


Tijfjj + VijVj = 0 ( 6 . 11 ) 

This is a set of n coupled 2nd order differential equations. The solution of this equation rjj describes the motion of the system 
near equilibrium. 


We now look for solutions of this equation in the form of oscillations: 


r]i(t) = A t e luJt . (6.12) 

Here, Ai are complex amplitudes. However, it is implicitly understood that it is Re{7^} that corresponds to the real physical 
motion. We only work with complex quantities because it is mathematically more convenient than working with cos and sin. 
Inserting this ansatz into the equation of motion, we get 

VijAj = uP A j = 0. (6.13) 
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These are now n homogeneous equations for the amplitudes Aj . From the theory of linear algebra, we know that such a system 
has a non-trivial solution for Aj if the following condition is satisfied: 


det(V — lo 2 T) = 0 (6.14) 

where V is the potential energy matrix with entries Vij = Vij and similarly for T. This equation determines the allowed 
eigenfrequencies of the system uj. There may be as many as n different eigenfrequencies: however, there could also be degenerate 
values for u. Let us denote the eigenfrequencies {cc a }. Physically, we must demand that are all real numbres. If not, we 
could write uj a = u/ + ico", giving us a solution r\i oc rjuj"t. This would be either exponential increas ( uj" > 0) or damping 
(uo" < 0) of the motion, which is not consistent with energy conservation. 


By solving Eq. (6.14), one finds c o a . The next question is then: what do the corresponding amplitudes look like? Which parts 
of the system are moving, and how much are they moving, when they are oscillating in the mode a? To answer this question, 
we must determine the amplitudes Ai a : the amplitude of the oscillation along generalized coordinates qi in the mode a. The 
amplitude is, according to the above, determined by the equations 

(Yu - u^Ti^Aja = 0. (6.15) 


For a given u a , this are n equations determining n — 1 of the components in the amplitude vector: 


A 


a 


^ Ala \ 

Mot I 

\. • • A n a J 


(6.16) 


We are left with one undetermined component for every a , for instance A la . Since A la is in general complex, there are 2 
undetermined coefficients (the real and imaginary part or, equivalently, the amplitude and phase of Ai a ) for each a. This is the 
case for each generalized coordinate, so in total we have 2 n undetermined quantities. This is in agreement with the fact that we 
need 2 n initial conditions to solve Lagranges equations and determine the solutions completely. 
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Let us to begin with treat the case where all eigenfrequences are different. Our task is then to show that A ia is proportional 
to the minor A ia of the determinant \ V — u^T\. The minor (also known as cofactor) A ia is defined as the subdeterminant that 
one obtains by removing row i and column a from the original determinant. The sign of A i a is given by (—l) 2+a . Rather than 
giving a general proof that A ia oc A ia , we show it for a specific case in order to make the notation less overwhelming. Consider 
the case with n = 3 and the equation for a = i = 1: 


£(^ ^oFi^Aja — 0 . 
3 = 1 


(6.17) 


We write this out as 


(Vii — Ld\Tii)An + (V 21 — cj^T 2 i)t421 + (V 31 — ujIt 3 i)Asi — 0. (6.18) 

Now compare this expression with the equation we obtain from \ V — uj^T\ = 0 by setting a = 1 and expanding the determinant 
along the 1 st column: 

(Vi 1 — ^iTii)An + (V 21 — cjiTi 2 )A 2 i + (V 31 — CJ 1 Z 3 i)A 3 i = 0 (6.19) 

Since V and T both are symmetric, we may exchange the indices i and j in the equations with the A’s. By afterwards writing 
Aia = C a Aia where C a is a complex constant of proportionality, we arrive at 

Ci[(Vn - ulT n)Au + (V 21 - lo\T 12 ) A 21 + (V S1 - w?T 31 )A 3 i] = 0 ( 6 . 20 ) 

which is precisely the characteristic equation, obtained by using Ai a oc C a ■ 

We may now write the solution for rji a as follows: 

Via = C a A ia e~ 1Ua \ ( 6 . 21 ) 

so that the general solution for the real motion of the generalized coordinate rji becomes: 

n n 

R £Vi(t) = Re = Re ^2 c a\ a e~ lulat ■ (6.22) 

a=l a=1 

The time variation of rji is thus a superposition of n harmonic oscillations with phases and amplitudes determined by the initial 
conditions and frequencies fixed by the potential and kinetic energy matrices of the system V and T. It is convenient to here 
introduce new independent coordinates 0 a according to 

©o ,(t) = Re{C a c~ iul " l }. : (6.23) 

so that the solution is 

n 

Reyijt) = ^2 Aj Q e tt (t). (6.24) 

a =1 

We will now show that the equations of motion for O a (£) are uncoupled'. 

Q a +uj 2 a Q a = 0, am 1,2,... (6.25) 

We will refer to & a as the normal-coordinates. The system oscillates in normalmode a with eigenfrequency cj a . To see this, let 
us insert the normalmodes into our expressions for kinetic and potential energy: 

T = -^TijRerjiRQrjj = —TijAi a Qa^j/3®f3') 

V = ^VijRerjiRerjj = i^-A ia 0 a A^0^. (6.26) 
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Note that we must take the real part of rji and rjj before multiplying instead of after multiplying, since it is Re{r/i} that corre¬ 
sponds to the true physical motion (taking the real part after multiplication would give a spurious contribution from the product 
of the imaginary parts of rji and rjj). The expression for V can be rewritten by using that: 

VijAjoc — u a T tj A ja > VijAj a = uj a TijAj a . (6.27) 

Now, we wish to see if we can manipulate the factor Yl7j =1 TijA ia Ajjg somehow, since it appears both in the expression for T 
and V. We have noted previously that the following equation is satisfied: 

n 

J2(V ij -co 2 a T ij )A ja =0. (6.28) 

3 = 1 

Let us write this equation both for a and /?: 

Aja = TijAj a , 

3 3 

Vij Aj/3 = ^/3 TijAjp. (6.29) 

3 3 

By using Aj a = C a Aj a and multiplying the lower line of Eq. (6.29) with JE A ia , we get 

^ ^ AjaVijAjp = ^ ^ Aj a ujpTijAjp . (6.30) 

ij 

By using that and similarly for T, Eqs. (6.29) and (6.30) can now be combined into 

( w a “ Wjg) X! T ij A i<* A i0 = 0. (6.31) 

ij 

It is clear that if cc a 7 ^ cc#, it is the sum that has to be zero. However, if a = /?, then the sum does not have to be zero. 

From our definition of as the minor (i, a) of the determinant \ V — cc^Tj, we are not allowed to choose one particular value 
for JLj TijA ia Aj a since both A’s are uniquely defined. However, let us modify our previous statement slightly and instead say 
that A ia is proportional to the minor. This is justified as follows. We showed previously that Ai a is proportional to the minor 
via a complex coefficient C a . Thus, it should work equally well to set 

A** = C^A* (6.32) 

with new coefficients of proportionality C' a and then saying that A ia is just proportional to the minor. So what is the gain in 
doing this? The point is we now have the freedom to choose a specific value for J2ij TijA ia Aj a . In other words, we can choose 
the normalization of A ia . A particularly convenient choice is to choose: 

Y^AiaAy = (6.33) 

ij 

The purpose of this choice for normalization is that we can now simplify the expressions for T and V expressed in terms of the 
normal-coordinates. We obtain: 


T = ^ E E TijAiaAjpQaQp = | ]T(0 q ) 2 , 

a/3 ij a 

iaAjpU^OaOp = 1 E ( 6 ‘ 34 ) 

a/3 ij a 


The Lagrange function L = T — V then becomes, expressed in terms of the normal-coordinates as independent variables, 

l = \Y j \.^D 2 -“I q V\ ( 6 - 35 ) 
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Lagrange’s equations (d/dt)(dL/dQ a ) ~ dL/dQ a = 0 then take the form: 

0 a +u;^0 a = 0. (6.36) 

This is a remarkable result: the equations are now completely decoupled and can be solved once the eigenfrequencies uo a 
have been determined and by using appropriate boundary conditions. Note that if two or more of the eigenfrequencies uj a are 
degenerate, the procedure becomes slightly modified. 


B. Application to a triatomic linear symmetric molecule (C0 2 ) 

Consider the CO 2 molecule where the atoms are linearly arranged (0=C=0), see figure below. In equilibrium, the distance 
between the atoms is written as 


X 01 - X 02 = oc 0 2 ~ 0CO3 = b. (6.37) 

The deviation from equilibrium can thus be written as 

rji = Xi- x 0 i, i = 1,2,3. (6.38) 
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Assume that only the nearest neighbors are interacting via a harmonic potential: 

V = 1 - x 2 ) - b] 2 + ^k[(x 2 - Xz) - 6] 2 = (&/2)(jyi - r ? 2 ) 2 + (k/2)(rj 2 - mf- (6-39) 

The kinetic energy is 

T = {m/2)x\ + (m/2)x\ + (M/2)i 2 = (to/2)(t ) 2 + r%) + (M/2)f,l (6.40) 

The Lagrange function is L = T — V as usual. We can simplify the situation by assuming that the center of mass is at rest, 
which still allows for the possibilities of vibrations of the atoms relative each other. The center of mass position thus satisfies: 

m(i 7 i + 773 ) + Mt] 2 = 0 . (6.41) 


Let us define the new coordinates 


Q a = vi + V3, Qs = m - m- (6.42) 

It will later transpire why the subscripts ’a’ and ’s’ are suitable for these coordinates. In terms of the new coordinates, we obtain: 

m = (Qa + Qs)/ 2 , m = (Qa - Qs)/ 2 , m = -mQ a /M. (6.43) 

by using the fact that the center of mass is at rest. By expressing L in terms of Q a and Q s instead of the rji quantities, we get 
after some algebra: 


m /i 

4M 


Ql 




\2M> 


Ql + Ql /2 


(6.44) 


We introduced the total mass /i = M + 2m. An important observation is that there are no cross-terms between Q a and Q s in the 
above expression. This means that Q a and Q s are in fact the normal coordinates of the system: their corresponding Lagrange 
equations are decoupled. In fact, we can immediately identify the eigenfrequencies by first introducing the rescaled quantities 


©s = %J m /2 Q s , Q a = \jm/i/(2M) Q a 


which gives 



-el + © 2 - ^,e 2 ] 

m a mM 


(6.45) 


(6.46) 


Based on the general form of the Lagrange equations for normal-coordinates derived previously, we see that the eigenfrequencies 
are: 


= y/k/m, u a = y/kp,/(mM). 


(6.47) 


Now, we did ’’cheat” a little bit by introducing the normal-coordinates right away - under usual circumstances, it could be 
difficult to just guess which linear combination of the original coordinates that provide the normal-coordinates of the system. 
However, an important clue was that all cross-terms between Q a and Q s were removed. Thus, one viable strategy would be to 
look for a coordinate transformation which removes all cross-terms in the Lagrange function: the new coordinates obtained in 
this way should be the normal-coordinates. 

If we didn’t know this, we could still find the eigenfrequencies simply by computing the determinant \V — uj 2 T\ = 0. The 
elements of V l3 and T tJ are determined from the general form 

t =\Y 2 ^oVirij, V = i WViVj ■ (6-48) 

ij ij 


Comparing with our expressions for T and V for the present triatomic molecule, one finds 



o 

1 

o 

o 

V = 

1 

to 

1 

II 

o 

o 


1 

o 

s 

o 

o 


(6.49) 
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Evaluating the determinant gives the following result: 

uj 2 (k — uj 2 m)(k/j J — uj 2 mM) = 0. (6.50) 


This equation has three solutions: 


cui = 0, 

CU 2 = W 8 , 

cu 3 = cu a . (6.51) 

Hang on - when we introduced the normal-coordinates, we found only two eigenfrequencies. Why are there three now? Before 
continuing to read, take a moment to consider this question. Is there some difference in the assumptions we made when using the 
normal-coordinate method and the calculation of the determinant? The answer is yes: when expressing the Lagrange function 
with the normal-coordinates we made the simplifying assumption that the center of mass was at rest. In contrast, we did not 
make that assumption when finding the eigenfrequencies from the determinant and hence we found one more solution: cu = 0 , 
which corresponds precisely to uniform translational motion of the entire molecule (no relative motion between the atoms)! The 
two other solutions cu 2 and CU 3 are the same ones as we found before. Now, cu 2 = uo s is the so-called symmetric mode which 
corresponds to the two oxygen atoms oscillating in phase toward the carbon atom (thus moving in opposite directions at all 
times) whereas CU 3 = uo a is the antisymmetric mode where the two oxygen atoms move in the same direction while the carbon 
atom moves in the opposite direction of the oxygen. Note that in both cases, the center of mass is at rest. 
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VII. THE THEORY OF SPECIAL RELATIVITY 

Youtube-videos. 33-43 in this playlist. 

Learning goals. After reading this chapter, the student should: 

• Understand the concepts of Lorentz-transformations, Lorentz-invariance, and Minkowski-space 

• Be familiar with 4-vectors and their mathematical properties 

• Be able to solve elementary problems in relativistic mechanics, such as scattering of particles 

• Be able to explain the concept of threshold energy and know about how EM fields transform relativistically 


A. Introductory remarks 


Assume that we are considering a system which size is characterized by a length L. Let A = h/p be the de Broglie wavelength 
for a body in the system moving with momentum p (h is Planck’s constant). Finally, v is a typical velocity for a body in the 
system while c is the speed of light. With these definitions, we can now characterize the realm of physics with four quadrants as 
depicted in the below figure. 


L 


A A 


i 

A ~ L,v <C c 

Non-rel. QM 
(Schrodinger eq.) 

A ~ L,v ~ c 

Relativistic QM 
(Dirac eq.) 

A <C L, v <C c 

r 

A<L,r 

Non-rel. classical mech. 

Rel. classical mech. 

(Newton, Galilei). 

(Einstein eq.) 


V 


c 


In this chapter, we will consider systems in the lower right quadrant where the typical size of the system far exceeds the de 
Broglie wavelength of particles in it, but where the velocities under consideration are comparable to the speed of light. This 
is the realm of Einstein’s special theory of relativity. For accelerated inertial frames and for a more thorough treatment of 
gravitation in relativistic physics, one must turn to Einstein’s general theory of relativity. Although of interest, we will here not 
focus on early experiments (Michelson-Morley), philsophical perspectives on relativity, and apparent paradoxes. Instead, our 
goal is to use the formalism and toolbox that we have developed so far in classical mechanics to describe the special theory of 
relativity. 

A concept we will frequently draw upon is that of an inertial frame. The definition of such a frame is that Newton’s law is 
valid: objects will move in a straight line with constant velocity unless acted upon by some force. This is e.g. not the case in 
accelerated frames, where there exists fictious forces such as the Coriolis force. All inertial frames are thus in a state of constant, 
linear motion relative each other. In order to swap between the coordinates of two inertial frames S and S’, where S’ is moving 
with velocity v relative S, you might be accustomed to using the Galilei transformation: 

r' = r vt, i! = t. (7.1) 
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There is a problem with this transformation. It predicts that the speed of light should be different in S’ and S: d = c — v. 
However, it is experimentally known that the speed of light is the same in S’ and S, d = c, thus independent on the relative 
velocity between the inertial frames. The two basic postulates of Einstein’s theory of special relativity are: 

• The laws of physics are the same in all inertial frames. This is called the principle of relativity, which is also satisfied in 
Newtonian mechanics. 

• The speed of light in vacuum has the same constant value in all inertial frames and is independent on the motion of the 
light-source. 


B. Lorentz transformations 

Two inertial frames S and S’ are shown in the figure. S’ is moving with a velocity v relative S. Assume that their origos coincide 
at t = t' = 0 and that v \\ z. The Lorentz-transformation is then defined as follows: 

x' = x, y' = y , z' = 7 (z — vt ), t' = 7 (t — vz/c 2 ), 

7 = 1/\A - /3 2 , /? = v/c. (7.2) 



The inverse transformation is obtained by v —v. Note that the transformation equations are linear. The Lorentz- 
transformation is designed specifically to make the speed of light c invariant, i.e. the same seen from both systems S and S’. Let 
us prove this important result explicitly. 

Assume that there is a source in the origo of S’ that emits a light-wave at t = 0. The equation for the light-wave is then: 

r’ = cf -»■ Or ') 2 + (yT + {z’f = c\t’) 2 . (7.3) 

If we now express (V, y\ zt') with (x, y, z, t ) via Eq. (7.2) and insert the resulting expression in Eq. (7.3), one obtains 

x 2 + y 2 + z 2 = c 2 t 2 . (7.4) 

In effect, the equation for the light-wave in S is identical to that in S’. We see that the light-wave moves with velocity c both in S 
and S’. Note that in the non-relativistic limit v <C c, the Lorentz-transformation Eq. (7.2) reduces to the Galilei transformation. 
For an arbitrary relative velocity vector v between S and S’, the transformation becomes: 

r' = r + (7 - 1) ^2^ - 7 ct Pi 

t! = yt — —(3 • r. (7.5) 

c 
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where (3 = v/c. 

Due to the fact that the Lorentz-transformation is linear, we can use the same framework as previously done for rotations. We 
introduce a ’’fourth dimension” via the variable X 4 = i ct so that the 4-dimensional Minkowski-space has axes 


x\ = x, X 2 = y , X 3 = z, X 4 = i ct. 


(7.6) 


We see that 


4 

I 2 +|/ 2 +^ 2 - c 2 t 2 = x\ + x\ + x\ + x\ = ^2 (7.7) 

M=1 

where we used the sum convention for repeated indices. A vector in Minkowski-space is known as a 4-vector and describes 
an event taking place at a given position in time and space. To distinguish more clearly between 4-vectors and conventional 
3-vectors, we will use Greek indices for 4-vectors (a, /3, /i, ...) and Roman indices for 3-vectors (i, j, k ,...). The invariance 
of the speed of light can now be expressed by stating that x^x^ is invariant: it has the same value in all inertial frames. In 
other words, the norm of the ’’position vector” x M in Minkowski-space remains invariant under a Lorentz-transformation. The 
Lorentz-transformation is an orthogonal transformation in Minkowsi-space since it does not alter the norm of the 4-vector. This 
is in complete analogy to rotations in usual 3D space which also are orthogonal transformations since they do not change the 
length of a 3-vector. 

We can also write the Lorentz-transformation in matrix form: 

X' = LX or xp = L^Xv, (7.8) 
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where X denotes a 4-vector, i.e. X = (x, y , z, ict). Choosing the relative velocity along the 2 -axis gives the matrix: 

/10 0 0 \ 

01 0 0 

0 0 7 i/?7 

\0 0 — i/?7 7 / 

For a velocity v in arbitrary direction ((3 = v/c), one has 


(7.9) 


Ljk = Sjk + (7 - 1 )PjPk/P 2 , LjA = hPj, L 4k = —i 7 /?fc, L 44 = 7- (7.10) 

From our previous treatment of linear transformations, we know that an orthogonal matrix has the property L -1 = L T where T 
is the matrix transpose. The inverse Lorentz-transformation can thus be written 

%fl (7 / - {L (7.11) 


Sometimes, it is convenient to represent Lorentz-transformations as rotations with a complex angle. To see how this works, 
recall that the matrix 


( cos 0 sin 0 
— sin 0 cos 0 


(7.12) 


represents rotation with an angle 0 in the two-dimensional plane. Thus, the Lorentz-transformation in Eq. (7.9) is equivalent to 
a rotation in the xs — £4 plane with a complex angle 0 determined by 

7 = cos 0, i/Fy = sin 0. (7.13) 


When is this useful? It comes in handy when performign multiple successive transformations. Consider for instance a scenario 
where there are three inertial frames. S’ is moving with velocity v relative S and S” is moving with velocity v' relative S’. The 
goal is to find the velocity with which system S” moves relative S. We know that X' = LX and X" = L'X'. Thus, we may write 
X" = L"X if L" = L'L. The point is then that the ’’rotation” <f>" in Minkowsi-space taking us from S to S” must be equal to the 
total angle of rotation 0 + 0' in going from S to S’ to S”. Thus, we have </>" = 0' + <fi. We obtain 

W "=^X=i/3". (7.14) 

COS 0" 


But we also have that: 


tan 0 " = tan( 0 ' + 0 ) 


sin( 0 ' + 0 ) tan 0 ' + tan 0 
cos( 0 ' + 0 ) 1 — tan 0 tan 0 '* 


Combining the two equations yields: 


i/3" 


■ F + /3 

1 1 + /3/3' 


—)> V 


v f + V 
1 + vv'/c 2 


(7.15) 


(7.16) 


This is Einstein’s addition formula. We see that v" 7 ^ v' + v, which would have been the non-relativistic result, and also v" < c 
even if v ~ c and v f c. 


The properties of the Lorentz-matrix can be used to classify it further. Since it is orthogonal, we know that \L\ = ±1. If 
the determinant is equal to +1, we say that it is a proper Lorentz-transformation, meaning that it is possible to continuously 
transform it into the identity matrix 1 (if also L 44 > 1). If the determinant is -1, it is an improper Lorentz-transformation. 
This is e.g. obtained by inverting the space-axes, but not the time-axis. If both the space- and time-axes are inverted, we have 
L = diag(—1, —1, —1, —1) and thus |L| = +1. 
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C. Choices of metric 

We here discuss some details about the choice of so-called metric. Let us first introduce the following concepts: 

• Riemann-space: characterized by coordinates x M (which can be real or complex). An infinitesimal path element in this 
space has the square ds 2 = g^dx^dxy. 

• Metric tensor: denoted g with matrix elements g^ v . 

Covariant vector components have a lower index (x^) where as contravariant vector components have index up ( x M ). One trick 
to remember this is the following: ”co is low’”. The metric tensor itself has the property g^ u = g^ u and its effect is to raise and 
lower indices. For instance, one has 


X„ = g^x 1 ', X M = g^Xv 

For the infinitesimal path element, we may then write 


(7.17) 


ds 2 = dx^. (7.18) 

For complex Minkowsi-space, we had X 4 = ict. Using g = diag(l, 1,1,1) gives us ds 2 = dx 2 + dy 2 + dz 2 — c 2 dt 2 which 
we have seen is invariant (its value is the same in any inertial frame). For this choice of metric tensor g , we see that x M == x M 
and there is no distinction between covariant and contravariant vectors. This is a handy choice, but it comes at the expense of 
working with a complex variable x±. This choice of metric is characterized by the fact that Trg = 4. To see which alternatives 
we have, we could instead use 


x° = ct, x 1 = x, x 2 = y , x 3 = z. (7.19) 

To keep ds 2 = dx^dx^ invariant, we then need to choose either g = diag( — 1,1,1,1) (which has Trg = +2) or 
g = diag(l, —1, —1, —1) (which has Trg = — 2). Different choices are encountered throughout the literature. 


D. Covariant 3+1 dimensional formulation 

Note that ’’covariant” in this context is not related to our previous discussion of 4-vector components x^. Saying that a quantity is 
Lorentz covariant means that it changes via a Lorentz transformation when going from one inertial frame to another. Scalars and 
4-vectors are examples of such quantities. An equation is Lorentz covariant if it may be written with Lorentz covariant quantities. 

The second postulate of the special theory of relativity was that the laws of physics should be the same in all inertial frames. 
Another way to put this is that physical laws have to be Lorentz covariant, i.e. they need to have the same form in all inertial 
frames. For instance, if a quantity C M = D fJ in one frame, then C' [} = D'^ in another frame where the primed quantities are 
obtained via a Lorentz-transformation. 

Let us consider a point in Minkowsi-space with coordinates x = (xi, x<i, £ 3 , £ 4 ). A particle moving in this space will be 
described by a path called a ’’world line” or eigenline. A small change in the coordinates along the eigenline is described by 
dx We have seen that dx^dx^ is a Lorentz-invariant, in effect it has the same value when measured in any inertial frame. 
Because of this, we can define the eigentime r as follows: 

dx^dx^ = — c 2 dr 2 . (7.20) 

Now, imagine that we attach an inertial system S’ to a moving particle. The particle, as seen from S’, is then at rest. We obtain 

dx^ = (0,0,0,i cdt') —>> dx'^dx'^ = — c 2 dt 2 (7.21) 

From the definition of eigentime, we then see that in this case dt = dr. We can now physically interpret dr: it is the time 
measured by a clock moving along with the particle (hence the name eigentime). 
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As seen from the stationary inertial frame S, the particle above is moving with velocity u where u 2 = ( dx/dt ) 2 + ( dy/dt ) 2 + 
(< dz/dt ) 2 . We obtain 


= ( dx , dy , dz, icdf) (7.22) 

-7 dx^dx/j = dx 2 + dy 2 + dz 2 — c 2 dt 2 (7.23) 

-c 2 {dr/dt) 2 =u 2 -c 2 (7.24) 

dr 2 = (1 -u 2 /c 2 )dt 2 (7.25) 

d'f 

dt = — , > dr. (7.26) 


This means that the clock in S (where the particle is moving) displays a time dt which is longer than the time dr displayed on a 
watch in S’ (where the particle is at rest). This is the relativistic phenomenon of time dilation : moving clocks run slower. 


There exists a similar phenomenon regarding the measurement of lengths in inertial frames that are moving with respect to each 
other known as length contraction : moving objects appear shorter. To see this, consider an object with length V as measured 
in a frame S’ moving together with the object. In a stationary frame S, we know that the following relation exists between the 
coordinates z' and z' (assume that the relative motion occurs along the z-axis, without loss of generality): z* = 7 (z — vt ) where 
v is the relative velocity between S and S’. We know that L' = z f 2 — z[, which gives the equation 

L' = 7 L (7.27) 

In effect, the object is measured to have a length L < L' and is seen from S as shorter compared its rest-frame length measured 
in S’. 
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Example 17. Lifetime for muons in the atmosphere. These elementary particles, belonging to the lepton family, are typically 
created around d ~ 6 km away from Earth via cosmic rays. The lifetime of a muon in its restframe is r = 2 x 10 -6 s. Its speed 
relative Earth is v = 2.994 x 10 8 m/s. 





Non-relativistically, a muon would thus manage to cover a distance L 0 = vr = 600 m. This means that no muons should be 
able to reach the surface of Earth - however, this is in strong disagreement with experimental measurements where many muons 
are detected. As suspected, we have to treat the problem relativistically due to the high velocities of the muons. We can take 
two perspectives: 

(a) As seen from Earth. The lifetime of the muon is subject to time dilation and its lifetime, as seen from the surface of Earth, 
becomes t = r/^/l — ( v/c) 2 ~ 32 x 10 -6 s. Moving with its speed v it can now cover a distance L = vt = 9.6 km > d and 
thus reach Earth. 

(b) As seen from muon. Earth is moving with a relative velocity v toward the muon. But the distance from the muon to Earth, 
as seen from the muon, is subject to length contraction according to our explanation preceeding this example. Thus, the muon 
sees a distance z = z'^/l — /3 2 ~ 375 m. As a result, Earth reaches the muon after a time t = z/v ~ 1.25 x 10 -6 s < r, i.e. a 
shorter time than the muon lifetime. This has been experimentally verified in Bailey et al., Physics Letters 55B, 420 (1975). 


It is important to note that the path element ds 2 = dx^dx^ can be both positive, negative, or even zero. We say that a 4-vector 

is 

• Spacelike if x^x^ > 0. 

• Timelike if x^x^ < 0. 

• Lightlike if x^x^ = 0. 

Note that we are here using x M as notation for a 4-vector when it appears isolated, whereas it means the /i-th component when it 
appears in a summation such as x^x^. Now, why these names? To see this, consider two events described by x\^ = (rificti) 
and X 2 ^t = ( 7 * 2 , i ctf). Consider the difference between these 4-vectors: 


Xfj, = x lft - x 2fi = (r 1 - r 2 , ic(ti - t 2 )) (7.28) 

which has the norm 

X^X^ = \ ri -r 2 \ 2 -c 2 (h-t 2 ) 2 . (7.29) 

Choose the coordinate systems so that r\ — is along the z- axis. In a system S’ moving with relative velocity v along the 
z-axis, we have 

t'i ~t 2 = 7 [ti -t 2 - v{zi - z 2 )/c 2 } = ( 7 /c) [c(ti - t 2 ) - P(zi - z 2 )\. (7.30) 

If Xp is space-like, then > 0 which according to the above equation 

c\ti - t 2 \ < \zi - z 2 \. (7.31) 
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In this case, we can always find a velocity v < c in Eq. (7.30) so that t[ — t' 2 = 0. In other words, we can always find an inertial 
frame S’ moving with velocity v relative S where the two events are seen as simultaneous. If X^ is timelike, we have by the 
same reasoning 


c\ti - t 2 \ > \zi - Z 2 \. (7.32) 

meaning that we cannot find an inertial frame S’ where the two events occur simultaneously. Events with spacelike separation 
cannot be connected with a signal moving with the speed of light c, since they can occur simultaneously. Timelike events, on 
the other hand, can be connected with a signal moving with c so that the two events can influence each other. 


Before moving on to discuss relativistic mechanics, we introduce some additional 4-vectors besides the position 4-vector x^. 
The 4-velocity is defined as 


= 


dx,j, 

dr 


(7.33) 


Since x M is a 4-vector, is also a 4-vector since dr is a Lorentz-invariant scalar. Since dx M == (dxi, i cdt), one obtains 


tt M = j(v, ic). 


(7.34) 


Note that = j 2 v 2 — 7 2 c 2 = —c 2 . 


The 4-current density j M is defined as = (j , icp) where j is the current density and p is the charge density. The continuity 
equation, guaranteeing conservation of charge, has the familiar form V • j = dp/dt = 0. This can be written compactly with 
the 4-current density as 

-£-j, = = 0. (7.35) 

We note that is actually the 4-vector p^u^ where po is the chage density in the inertial frame where the charges are at rest: 

3n = Po u n = (7Po«, ic-fpo) = (pv, icp) = (j, icp). (7.36) 


The charge density p for moving charges is larger than p 0 due to length contraction: p = 7 p 0 > po- The continuity equation 
is a physical law which should have the same form in all inertial frames, i.e. it should be Lorentz covariant. Let us verify this 
explicitly. Since dx^ = (L T ) llu dx' v = L u/1 dx' u , it follows that 


d dxn d d 

dx' u dx' u dx M dx M ’ 


(7.37) 


Thus, both d/dx M and transform int he same way as dx M under a Lorentz-transformation and hence d^j^ must also be an 
invariant scalar. The continuity equation is thus Lorentz-covariant. 


E. Maxwell’s equation, 4-potential, and electromagnetic field tensor 

From two of Maxwell’s equations, V • B = 0 and V x E = —dB/dt , it follows that we can express the magnetic and electric 
fields via two potentials A and (j) as follows: 


B = V x A, E = -V0 - dA/dt. (7.38) 

In vacuum, we have zero electric polarization P and magnetization M, which gives D = £qE. From the third of Maxwell’s 
equations, X ■ D = p, we get 


d 


V 2 4>+f t (V-A) = -p/e 0 . (7.39) 

Moreover, we have that H = B/p 0 > which inserted into Maxwell’s fourth and final equation V x H = j + dD/dt gives 

(7.40) 


^2 . 1 d 2 A „„ 4 1 < 90 , 

V A - o ~T77o —V(V -Ah——) — — p 0 j. 


dt 2 


c 2 dt 
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Let us take a step back and see what we have accomplished. Maxwell’s four equations for E and B in vacuum have now been 
reduced to two equations for the potentials <j) and A, namely Eqs. (7.39) and (7.40). These equations are coupled, which makes 
their solution non-trivial. However, it is actually possible to decouple these equations by using the concept of gauge-invariance. 

To understand gauge-invariance in the context of EM fields, consider that since B = V x A, the physical magnetic field B 
remains unchanged if we perform the transformation 


A -> A' = A + Vx 


(7.41) 


where x is an arbitrary scalar function, since V x Vx = 0 is an identity. However, in doing this transformation the E field 
should also be left invariant. Since E = —V</> — dA/dt, we must therefore simultaneously perform the transformation 

0 = 0 - d X /dt. (7.42) 


This freedom to choose the potentials 4> and A as we like while keeping the physics intact (invariant fields E and B) is known 
as gauge invariance. We are thus free to choose x as we like, which can then be utilized to make the equations simpler. For 
instance, let us choose a x that satisfies 


v 2 x - = 0 

x c 2 dt 2 


(7.43) 
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This the so-called Lorenz gauge choice (not Lorentz : the gauge choice is due to Ludvig Lorenz, to be distinguished from Hendrik 
Lorentz). This gauge-choice guarantees us that the so-called Lorenz condition is satisfied: 


V • A + ——t — 0 
c z at 


(7.44) 


in any inertial frame. You can verify by direct insertion that for this choice of x, the equation V • A' + ^ = 0 is also satisfied. 

Let us now make use of the Lorenz-condition to decouple the equations for <fi and A in Eqs. (7.39) and (7.40). We get 

V 2 0 + <9(V- A)/9t = -p/e 0 
d ( 1 dq 




1 


Kv 2 - ^d 2 dt 2 ) 4 > = -p/e o. (7.45) 


This is the first equation which now describes 0 uncoupled from A. The second equation is obtain by observing that the second 
term in Eq. (7.40) is zero due to our Lorenz-gauge choice, so that it becomes 


< v 2 -?I 7 


-Mo J. 


(7.46) 


We now have two decoupled equations for and A. At this point, we introduce the 4-potential 


A^ = (A,i0/c) (7.47) 

so that the Lorenz-condition reads = 0. Now, the two uncoupled wave equations Eqs. (7.45) and (7.46) can be written 

compactly with 4-vector notation as one single equation 

= -M 0 .V ( 7 -48) 

Here, we introduced D 2 = d v d v = V 2 — This is D’Alembert’s operator. This is also a Lorentz-covariant equation since 

An transforms like while □ 2 is an invariant scalar operator. Hence, Maxwell’s electromagnetic theory is covariant. 


Let us finally consider how electromagnetic fields transform when changing inertial frame. Consider as before a system S’ 
moving with relative velocity v along the z-axis compared to a stationary system S. We know that the Lorentz-transformation 
matrix for this scenario reads 


L — Lfjiv — 


A o o 

o 1 o 

0 0 7 

\0 0 — i/?7 


0 \ 

0 

i^7 

7 J 


The electromagnetic field tensor F I IM is defined as 


Fpv = 


dA v dA b 


dxn dx v 


— A^v A 




(7.49) 


(7.50) 


An important property of this tensor is that it is antisymmetric, meaning = —F v ^. Here, A M = (A , i(j)/c) is the 4-potential 
introduced previously. It follows that we can also write 


/ 0 

b 3 

-b 2 

-iEi/c\ 

-B s 

0 

Bx 

-i E 2 /c 

b 2 

—Bi 

0 

-iE 3 /c 

\}Ei/c 

\E 2 /c 

i E 3 /c 

o / 


The Lorentz-transformation of F I1U reads 

lh , I J i ia L l/ /3F a /3, 


(7.51) 


(7.52) 
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which is a natural generalization of how the single-indexed 4-vectors transform. 

It is particularly instructive to consider the case v/c <C 1, in which case one finds 

E[mE 1 - vB 2 , E ' 2 = E 2 + vB u E' s = E s . (7.53) 


This can be rewritten as 


E' = E + v x B. (7.54) 

In other words, a particle moving in a constant magnetic field B will experience a net electric field E' even in the absence of 
any external electric field E. A similar transformation equation can be found for the magnetic field 

B' = B-\(vxE). (7.55) 

c z 

We thus see that a particle moving in a constant electric field E will feel a net magnetic field B '. Asa side-note, this is actually 
the origin of the spin-orbit interaction in condensed matter physics, which is a relativistic effect. These transformation equations 
are also used extensively in magnetohydrodynamics. 


F. Relativistic mechanics and kinematics 

Newton’s 2nd law F = ma is invariant under a Galilei-transformation, but not under a Lorentz-transformation. Hence, it is not 
covariant and cannot be correct in the relativistic case. Thus, we have to find a generalization of this equation which 

• Satisfies the criterium of covariance in the special theory of relativity 

• Reduces to J^( mvi ) = Fi if Vi c. 

A natural choice is 


^ (rnu lx ) = K fl (7.56) 

where m is the invariant mass, r is the eigentime, is the 4-velocity, and is the Minkowski-force. We must demand 
that Kn has the property lim^^o Ki = Fi to regain consistency with the non-relativistic limit. Let us use electromagnetic 
theory as an example since we know that it is Lorentz-covariant and because we know what the force should look like, namely 

F = q(E + v x B). 

By expressing E and B with their potentials and A, and using that 


UpAp = '){v A-<j>), (7.57) 

one finds that the components of F may be written as: 

= (7.58) 

Now, both di and dAi transform as the space-components of a 4-vector, while and dr are invariant scalars. This allows 

us to identify Ki = 7 Fi where: 


K i = q(d i {u^A tl )- d E). 

Generalizing this to four dimensions, we then have the Minkowski-force 

K, = q(d^u u A u )-^E) 


(7.59) 


(7.60) 


describing the force acting on relativistic, charged particles. The relativistic version of Newton’s 2nd law then reads: 


K,= 


dPy, 

dr 


(7.61) 
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where p^ = mu M is the 4-momentum. Thus, F = dp/dt holds also relativistically since K = dp/dr and dr = dt/y. But what 
does the fourth component of K M give us information about? We see that 


= -J^(toc 2 /2) = 0. 

In effect, U{Ki + U 4 K 4 = 0, providing us with 

K 4 = i yF • v/c. 

The components of the Minkowski-force have now all been identified: 

K^m^F^iF-v/e). 

The total relativistic energy is 

E = ymc 2 . 


(7.62) 

(7.63) 

(7.64) 

(7.65) 


In the limit u « c, we may expand 7 = 1/ y/l — (v/c ) 2 in powers of v and one obtains to lowest order: 



(7.66) 


We have regained the familiar kinetic energy, but there is an additional contribution me 2 that persists even if the particle is at 
rest (v m 0). This is the so-called rest energy of a massive particle. If the particle in addition moves in an external potential V, 
this should be added to E to obtain the total energy. 


Let us consider the 4-momentum in some more detail. Just like x^x^, p^p^ is also an invariant quantity. In general, the so-called 
contraction of a covariant and contravariant 4-vector a [L b 11 is always a Lorentz-invariant quantity as can be verified by performing 
the transformation explicitly. For our choice of metric, there is no distinction between covariant and contravariant vectors so that 
Pii = Thus, since p M = (ymv, i ymc) = (p, i E/c) where p is the relativistic momentum, we get 

VixV[i — P 2 ~ E 2 /(? as — (me 2 ) 2 /c 2 = —m 2 c 2 . (7.67) 
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in the rest frame of the particle since then p = 0 and 7 = 1 . We thus obtain the above expression in the rest frame of the particle, 
but since it is a Lorentz invariant quantity we know that its value is the same in any inertial frame. Thus, it follows that the 
following relation holds generally: 

7-H2 2 2, 2 4 

E = p c Em c . 

This is the relativistic energy-momentum dispersion relation for a free particle. It has a number of important consequences. For 
instance, a photon has zero mass m = 0 and thus E = pc. As a result, it has momentum in spite of being massless. Moreover, 
since E = hu where h is Planck’s constant and v is its frequency, we get pc = hv which can be rewritten as 

p = h/X. (7.68) 

This is de Broglies formula for the wave-particle dualism. 

Example 18. Energy conservation and mass difference. Consider a scenario where two masses m collide and produce a 
new particle M. In the center of mass frame, the particles must initially move with velocities of equal magnitude but opposite 
direction, whereas the final particle M must have v m 0. Energy and momentum conservation can now be expressed in a single 
statement: conservation of total 4-momentum p^. Now, the total 4-momentum before the collision is 

P,, =Pi )J ,+P 2 n = (Pi + p 2 ,iE 1 /c+iE 2 /c). (7.69) 

After the collision, we have: 

P[i — (0, iMc) (7.70) 

since M is at rest and thus only has rest energy. It follows that conservation of the fourth component of P M (conservation of 
energy) gives 

2 7 mc 2 = Me 2 . (7.71) 

In effect, M > 2 m since 7 > 1. The mass has increased! Relativistically, mass is thus not conserved. The fundamental reason 

for this is its equivalence to energy via the relation E = me 2 for the rest energy. Kinetic energy before the collision has thus 

been converted to rest energy, in effect mass. 


Another important situation, of particular relevance in high energy physics and particle physics, is where two particles col¬ 
lide/interact and form several new particles (possibly more than 2). We thus have the scenario shown in the figure below. 



Without knowing the details of the moment of collision, we can still use classical mechanics on the system a sufficiently long 
time before and after the collision. It is beneficial to consider this process in the CM frame where the total momentum is P = 0 
both before and after. Since 4-momentum is conserved (P M before, P' fJ = P [Jb after), we know that 

P,uP^ = 77 = (- P ') 2 + (i^'/c) 2 = -( E’/c) 2 m —M 2 c 2 . (7.72) 

95 

Download free eBooks at bookboon.com 



Introduction to Lagrangian & 
Hamiltonian Mechanics 


The theory of special relativity 


In the last step, we introduced the so-called equivalent mass M = E'/c of the system. It is not equal to the total mass of the 
particles in general: only in the special case where the momentum of each of the particles after the collision is zero (as opposed 
to the more general statement of their sum being zero) will M = m r where the sum goes over all particles after the collision. 
This leads us to the important concept of threshold energy. The definition of this quantity is that it is the smallest kinetic energy 
of the initial particles that will enable the reaction to proceed. It corresponds precisely to the case where all masses produced in 
the collision are at rest afterwards in the center of mass frame: no extra energy has been imparted to give them any kinetic energy. 

Let us find an expression for this threshold energy which we denote K\. We know that 

P»Pn = (pin +P2n)(Pin +P2n) = ~m\c 2 - m\c 2 + 2 (p 1 ■ p 2 - E^/c 2 ) = ~M 2 c 2 . (7.73) 

But we also know that P fJ P fI is a Lorentz-invariant quantity. Thus, we can compute it in any inertial frame and get the same 
result. Let us therefore now consider the lab-frame where one particle is at rest to begin with, p 2 = 0. This means that 
E 2 = m 2 c 2 . We obtain 


M 2 c 4 = (m 2 + m 2 )c 4 + 2E\m 2 (? = (mi + m 2 ) 2 c 4 + 2m 2 c(Ei — m\(?\ (7.74) 

Now Ei — m\(? is precisely the kinetic energy of particle 1: its total energy minus its rest mass. It follows that the total kinetic 
energy before the collision is K\ — E\ — mic 2 . From Eq. (7.74), we see that right at the threshold energy (where M = ra r ), 
we obtain 

( y^ m r ) 2 c 4 = (mi + m 2 ) 2 c 4 + 2m 2 (?K\ . (7.75) 


Solving for K i, we get 


Ki _ Q^r TO r ) 2 ~ (mi + TO 2 ) 2 

mic 2 2 rriirn 2 

The Q-value of the reaction is defined as the increase in mass after the collision in the CM-frame: 

Q — m r - (mi H- m 2 ). 

r 

With this definition, we get 

( E mr ) 2 “ ( mi + m 2) 2 = Q 2 + 2Q(mi + ro 2 ) 


(7.76) 


(7.77) 


(7.78) 


which in terms of the threshold energy then becomes: 

Ki _ Q 2 + 2Q(mi + m 2 ) 
m\(? 2 m\m 2 

This equation then expresses the minimum kinetic energy K\ that must be available initially in order to enable a reaction, as a 
function of the Q-value (mass increase) of the reaction. 


Example 19. Production of antiprotons. Consider the reaction p + N^p + N + p + p. Here, N is a nucleon (n or p). The 
neutron and proton mass are very similar, so that we may set 

rri p ~ m n nip = m = 938 MeV/c 2 . (7.80) 

The Q-value is then 4m — 2m = 2m. The threshold energy then becomes 

Ki = 6 mc 2 , (7.81) 

which is around 5.57 GeV or 3 Qc 2 . In other words, the initial proton must have a kinetic energy of at least 3 Q in order to enable 
the reaction when N is at rest. 
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However, if we instead consider the CM-frame where p x = — p 2 , one would have found the result K\ = = me 2 = Qc 2 / 2. 

In other words, the threshold energy for the initial particles is now far less than in the first case. What is the physical reason 
for this? In the first case where we considered the lab-frame, conservation of momentum dictates that there must be a finite 
momentum after the reaction. In other words, the initial kinetic energy must be sufficiently high to both produce the rest mass 
energies of the final particles and to provide them with the required kinetic energy to satisfy momentum conservation. In contrast, 
if the initial collision takes place in the center of mass frame, there is no requirement that the final particles must have any kinetic 
energy: they are all allowed to be at rest. In this case, all the initial kinetic energy can be converted to rest mass energy and 
thus the threshold is lower. This is precisely the reason for why it is beneficial to experimentally accelerate particles in a ring in 
opposite directions rather than firing particles at a stationary target. 


G. The relativity of simultaneity 

An important conceptual point regarding relativistic transformations is the relativity of simultaneity. In short, this means that 
two events 1 and 2 that occur simultaneously in one frame (£i = £ 2 ) do not occur simultaneously in another frame (t[ 7 ^ t' 2 ) if 
the two events are separated in space (r 1 r 2 ). This is a key aspect which must be taken into account to avoid inconsistent 
situations in the special theory of relativity. For instance, we have stated that moving clocks run slower. But that would mean 
that two persons moving relative each other would both conclude that the other person’s clock runs slower, which seems strange. 
How does one resolve such an apparent paradox? The answer is that one must take into account the relativity of simultaneity. 

Let us try to illustrate this with a simple example. Imagine that you have two clocks that are separated by a distance L. We now 
want to synchronize the clocks, i.e. making sure they start at the same time, by emitting a flash of light from a bulb positioned at 
the midpoint of the two clocks. This is fine if the system is stationary, but imagine now that the entire arrangement (both clocks 
and the bulb at the midpoint) is on a bus and thus moving with a velocity v. For a stationary observer, it is clear that the flash of 
light will take longer to reach the clock to the right since it is moving away from the bulb, whereas the flash of light will take a 
shorter amount of time to reach the clock to the left since it is moving toward the bulb. A stationary observer is thus exposed to 
two relativistic effects due to the moving clocks: 
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• Both clocks will run slower than his clock by a factor y/l — (v /c) 2 since they are moving: this is the phenomenon of 
time-dilation. 

• But there is an additional effect: the two clocks are not synchronized as seen from the stationary frame (even if they started 
at the same time as seen from the bus). In fact, one can show that he will see the left clock reading vL/c 2 seconds at the 
moment the flash of light reaches the right clock, causing it to start running. 

So the clocks on the bus, as seen from the stationary observer, run slower but are also not synchronized. 


Left clock Light bulb Right clock 



Taking this into account, we can now show why two observers moving relative each other can both conclude that the other 
person’s clock is running slower - and both be right! The following example is a rewritten version of ’’Time Dilation: A Worked 
Example” given by M. Fowler, UVa Physics. Consider the frame of Alice where there are two synchronized clocks C\ and C 2 
that are separated by a distance L. Now, Bob has a very fast airplane and flies by at a speed of 0.8c (see figure below). Bob has 
a clock Cb on his airplane which starts at the same time as Ci, i..e. C\ and Cb are synchronized. Alice would measure that the 
airplane requires a time of t = L/0.8c to travel from C\ to C 2 . Bob’s clock, which is moving relative Alice, would be subject to 
time dilation and should thus show a time ts = tyj 1 — ( v/c) 2 = 0.6t. 

So, at the moment when Bob passes by the clock C 2 both Alice and Bob will agree that C 2 shows a time t whereas Cb shows 
a time 0 M. In effect, Bob’s clock is running slower, which Alice is perfectly happy with since she knows that moving clocks 
run slower. But what about Bob? How is this consistent with the fact that according to Bob, it is Alice’s clock that should be 
running slower since she is moving relative him? 

This is exactly where the relativity of simultaneity comes into play. The clock Ci, which is synchronized with Cb , is indeed 
running slower than Cb- In fact, since Cb is showing the time 0.6£, C\ should be showing the time 0.6 2 £ = 0.36£. But C 2 should 
not display this time, because as seen from Bob, C 2 is not synchronized with C\. Due to the reasoning presented in the initial 
example with the lightbulb on the bus, C\ is behind C 2 by Lv/c 2 = 0.8L/c seconds. Bob can thus safely state that C\ is indeed 
running slow compared to Cb as he passes the clock C 2 , whereas Alice can safely state that Cb is running slow compared to her 
clock C 2 as Bob flies by. 
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VIII. CANONICAL TRANSFORMATIONS 

Youtube-videos. 44-51 in this playlist. 

Learning goals. After reading this chapter, the student should: 

• Be familiar with phase space and the equivalence between Lagrange’s and Hamiltons’s formulations 

• Understand how Hamilton’s equations can be derived from a modified Hamilton’s principle 

• Understand how canonical transformations are derived from a generating function 

• Know Poisson brackets, and the related Jacobi’s identity and Poisson’s theorem 

• Know Liouville’s theorem 

• Know the essentials of the Hamilton-Jacobi theory 


Canonical transformations are related the Hamiltonian formulation of mechanics. We saw in Chapter III that it is usually not an 
advantage to use the Hamiltonian formalism instead of the Lagrangian formalism when solving specific problems in mechanics. 
The advantages with the Hamiltonian formalism are of a more fundamental kind, namely that the coordinates q and the momenta 
p are considered to be independent variables on the same level. This is quite an important point, especially in statistical mechanics 
and in quantum mechanics. 


A. Transformation of phase space 


The phase space is spanned by 2 n axes; n axes for qi and n axes for p im We remember that canonical momentum was given by 


and Hamilton’s equations were 


Pi 


dL_ 

dqi 


p(q,q,t), 


i G {1,2,..., ?7-}, 


& = Wi \. 

Pi = ~Wi J 


i = 1,2,..., n. 


We have already seen that we can perform “usual” coordinate transformations, 


Qi = Qi(q,t) 

for example from Cartesian coordinates q = {x,y} to plane polar coordinates, q = {r, $}. A transformation of this kind is 
called a point transformation. More generally, and still in accordance with the Hamiltonian formulation, we can transform 
both qi and pi , implying a transformation of the phase space: 


Qi = Qi(q,p,t) 

Pi = Pi(q,p,t). 

Here Qi and Pi are canonical coordinates which satisfy the “Hamiltonian equations”: 



ok 

dPi 

dK 

dQi 


The function AT(Q, P, t) is the Hamiltonian in the new coordinates, and (8.1) is a canonical transformation. 


( 8 . 1 ) 
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We remember from Chapter II that Lagrange’s equations can be derived from Hamilton’s principle: 

Pt2 


ft 2 

5 L(q,q,t)dt 
Jti 


= 0 . 


Correspondingly, Hamilton’s equations be derived from a modified Hamilton’s principle: 


4 \Pi<ii - H (p, q , t)\dt = 0. 

Jti 


( 8 . 2 ) 


This can be seen by starting from the variational principle 

rt 2 


51 = 4 f(q,q,p,t)dt = sf \p t qi - H(p,q,t)]dt = 0 

Jtx Jt! 


in which pi and qi are looked upon as independent coordinates. Note that /(g, g,p, t ) = piqi — H(p , g, t) does not contain pi. 
The variational principle SI = 0 thus leads, as we have seen, to the Euler equations for /, under the condition that we can set 
the boundary terms at t m £ 1? t<i equal to 0. This is straightforward for the coordinates qp 


o. Sqi 


1 *2 


M j tl 

because 5qi(t\) = 5qi(t 2 ) = 0. We need however also the property 


= 0, 


'df. 

dPi u 


= 0 . 
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Why is this true, since there is no condition on Spi at the end points? The explanation is that the function / does not contain 
p, what implies = 0; cf. the expression for / above. The last equation is thus satisfied after all. We then obtain the Euler 
equations for /, which are the same as the usual Hamilton equations: 


_ddf_ _ d]_ n 

dt dqi dqi 


A 2 '[_ _df_ = 0 

M Qp. dpi 


Pi 


F = o 

dqi 


Qi 


PK _ o 

d Pi ~ U * 


Since Hamilton’s equations are satisfied also for P and Q, the modified Hamilton’s principle has to hold also in this case: 


sf [PiQi - K(P,Q,t)]dt = 0. 

Jti 


We compare with (8.2), and see that 


PiCti - H = PiQi - K + where SF(ti) = 5F(t 2 ) = 0. 


(8.3) 


Here the new function F is arbitrary function, except that it has zero variation in the end points. Therefore dF/dt does not 
contribute to the integral. 

Canonical transformations are useful in practice if F contains half of the variables from the old set and half from the new. One 
then calls F a generating function; it acts as a “bridge” between the sets (q,p) and (Q, P). When choosing the function F 
there are four possible alternatives which we will consider successively. 

1. Alternative 1: F = Pi (<?, Q, t ) 

This choice leads to 


PiQi — H — PiQi — K + 
= PiQi — K + 


dF\ 

dt 

dFi dFi . dFi • 
~dt + ~d^ Qi + dQi Qi 


Since qi and Qi are considered independent, this equation is satisfied identically only if 


Pi 

Pi 

K 


_ dF i 
_ dqi 
— dF 1 
~ dQi 


— HP ^ L 

~ n + dt • 


(8.4) 


2. Alternative 2: F = F 2 (q : P, t) - QiPi 
We now get 

Piq { - H = 

In this case qi and Pi are considered as independent coordinates, so that we obtain 

Pi 

Qi 

K 


dF 2 

dqi 

dF 2 

dPi 


(8.5) 


PiQi 




QiPi - QiPi 


5F 2 dF 2 . dF 2 ■ • 

~ K + ~nT + HZ 7® + 7m Pi _ QiPi - 
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Note that F\ (g, Q, t) and F 2 (g, P, t) are related via the Legendre transformation (with Pi = — 

Pi = — PiQi + F 2 (q, P, t) 

" -V-' 

“Integration constant.” 

This means that the coordinate P is left out and Q comes in (or vice versa). 


3. Alternative 3 : F = qipi + F 3 (p,Q,t) 


The third possible choice is P = qipi + Fs(p, Q,t). From 


dPc. 

Pi© ~H = PiQi -K+ — 

at 


we get 


Piqi - H = PiQi - K + pm + piqi + Q, t) 

OF-. dF>, OF. ■ 

-Piqi -H = PiQi-K+^ + Pi + g^-Qi. 


As in this case pi are Qi independent, we get in the same way as above: 


Qi 

Pi 

K 


dF 3 

dpi 

dF 3 

dQi 


-HP ^ 

~ n + dt • 


( 8 . 6 ) 
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4. Alternative 4: F = g*pi - QiPi + P 4 (p, P, t) 


The same procedure as above gives now 


Piqi - H = 
-mi - h = 


PiQi — K + Pz#z + Pz#z — PiQi — PiQi + ^P^P, P, £) 

- ^ + ^ + ^Pi + ^P-Pi, 
at dpi oPi 


and as above, since pi and Pi are independent we get 


Qi 

Qi 


K 


= dF 4 

dpi 

_ 

dPi 


= H 


dF 4 

dt 


(8.7) 


Example 20. General transformation. Consider a transformation of type 2 above, given by F 2 = qi Pi. It gives us 

_ Si? 2 _ p . n - dFi - ■ p _ p 

Pz O Pil Qi or) ^Z 5 H. 

dqi dPi 

The function P 2 thus generates the identity transformation! 

To make it a little more general, let F 2 = fi(qi, ^ 2 , • ••, £) • Pi , whereby we get 

n 9i?2 f/ ^ 

Qi = = //(<■/• 0- 

The new coordinates depend only on the old coordinates and the time, not on the momenta. The transformation is of the type 
Qi = Qi{q, t), i.e. a point transformation. Thus, a point transformation is a special case of a canonical transformation. 


Example 21. Harmonic oscillator. We start with H = p 2 /2m-\-kq 2 /2 in “usual” coordinates (one dimension). With uo 2 = k/m 
we can write 

H = —(p 2 + m 2 u 2 q 2 ). 

2 TO 

If we can find a transformation 

f(P) 

p = f(P) cos <5; <? = -- sin Q, 

muj 

the K = H becomes cyclic in the new coordinate Q: 

K = H= FP(cos 2 Q + sin 2 Q) = FP. 

2m 2m 

We must determine f(P) such that the transformation becomes canonical. With the transformation above: 

p = mujp • cot Q, [independent off(P).] 


This corresponds to F of the type Fi(q, Q)\ 

d 1 

P = -q^Fi(q,Q) ==>■ Fi = -rnujq 2 cot Q 

The other half of the transformation is 


(simplest solution). 


dF\ mujq 2 
dQ 2 sin 2 Q 


q = 



sinQ. 
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A comparison with the original transformation equations gives: 


f(P) = V2rmJP, =>• H = PP- = coP 

1 ’ 2TO 

expressed in terms of the transformed variables. Since Q is a cyclic coordinate, the conjugate momentum P has to be a constant. 
Thus we have learned that H = E = total energy = constant, and we get P = E/uj. 

The equation of motion for Q becomes 


dH 

Q = —— = uj ==> Q(t) = wt + a (a to be determined by the initial conditions). 
oP 


The solution for q becomes 


2 P 


muj 


2 E 


q = \ -sinQ \ - - sin(ccf + a ), 


muj* 


which we recognize as the usual solution for a harmonic oscillator. 



B. Poisson 

The Poisson 


brackets 

bracket of two functions u and v with respect to canonical variables q and p is defined as 


\u,v 


q,p 


/ du dv du dv \ 

^r[\dqidpi dpi dqi) 


( 8 . 8 ) 
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One sees that 


l U ’ V lq,p V ’ U \q,p- 


Moreover it is evident that the following relations hold: 

[u, C] = 0 when C is const. 

h’<lj] q ,p = \Pi’Pj] q ,p = 0 
[vi’PAq# = 5 ij- 

Let F = F(qi,pi,t) be an arbitrary function expressed in terms of canonical variables qi,pi. Then 

dF _ dF A /dF . OF \ _dF A / dF dH dF dH 
dt dt + " \dqi q * + dpi Pt ) dt + V dq, dp t dpi dqi 

where we have made use of Hamilton’s equations. Thus 


dF dF 


If F is a constant of motion, dF/dt = 0, one thus has 


dF 


q,p ' 


In other words, if F does not depend explicitly on t, the condition that F is a constant of motion is that 

[*AU = o. 


(8.9) 

( 8 . 10 ) 

( 8 . 11 ) 

( 8 . 12 ) 


(8.13) 
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Other formal properties of the Poisson brackets are the following: 


[ui+u 2 ,v] qiP = [ui,v] qp + [u 2 ,v] qp 
[uiu 2 ,v] q p = Ul [u 2 ,v] q p + u 2 [ui,v\ p q 


du 


dv 


+ 

u , —— 

[dt \ 


[ dt \ 


(8.14) 

(8.15) 

(8.16) 


1. Connection with quantum mechanics 

In quantum mechanics a commutator between two operators u and v (remember that a function can also be regarded as an 
operator) is defined as 


[u, v] = uv — vu. 


The quantum mechanical property 


[qi,pj] = ihSij 

should be well known. Together with the relation we found, [Qi,Pj] qp = Sij , it seems natural to assume that the connection 
between quantum mechanical commutators and Poisson brackets is 

[u,v]=ih[u,v] qtp . (8.17) 

A somewhat closer justification can be given by comparing Heisenberg’s operator equation in quantum mechanics 


dA _ dA 
dt dt 


with equation 8.13. The equations take the same form if 


5 ■ 




2. Jacobi’s identity and Poisson’s theorem 

We will only mention (not derive in detail) these results. From now on we drop the subscript q ^ p . Jacobi’s identity reads 

[u, [v, w]\ + [v, [w, u]\ + [w, [u, u]] = 0. (8.18) 

This identity is actually made use of quite often in quantum mechanics (whoever has time and interest, can himself insert the 
definition of Poisson brackets in the expression on the left hand side and verify that the identity is right). 

Moreover, Poisson’s theorem reads: 

If F and G are two constants of motion that do not depend on time explicitly, then the Poisson bracket [F, G] will also be a 
constant of motion. 

Proof Insert w = H in Jacobi’s identity: 

[H, [F, G\] + [G, [H, F]] + [F, [G, H}] = 0. 

Since F and G are constants of motion and not explicitly time dependent, one has [JT, G] = 0 and [JT, F] = 0 from equation 
8.13. Accordingly 

[H, [FG]]=0, 

which (again from equation (8.13) implies that [F, G] has to be a constant of motion. This completes the proof. □ 
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Example 22. Poisson’s bracket between momentum and angular momentum. We want to find Poisson brackets made from 
the Cartesian components of the momentum vector p and the angular momentum vector L = r x p for a particle. Consider 


\Px,L x ] = \Px,UPz - ZPy \ = \p x ,y] Pz - \Px,z]p y , 

because [pi,Pk\ = 0. As \pi, Xk\ = 5ik> one has \p x ,y\ = 0, [ Px , z\ = 0 and thus [p x , L x \ = 0. Just the same would of course 
hold if we had replaced x with y or z. We consider further two different components, for instance 

\Px, Ly] = \p x , ZPx - xp z ] = \p x , Z} p x - \p x , x\p z =+ [x,p x \ p z = p z . 

=0 =1 

A cyclic interchange of the coordinates x,y,z above gives us 


\Py , )I J Z \ P X: \p Zl L x \ Py. 

Altogether: 

\_Pii Lj\ — ^ijkPk- 

Alternatively, we can express the above results in a more compact form: 


Correspondingly, we find 


[Pi 5 Lj ] — \pi , €jlm%lPm] — \Pii %l\ ^jlmPm — e jimPm — e ijkPk • 

=—Sn 


[Xi,Lj\ — [Xi , £jlm%lPm\ — £jlm%l — €jli%l — ^ijk%k' 

— — S irn 


Example 23. Poisson brackets between angular momentum components. We intend to find the Poisson brackets between 
angular momentum components. With use of the summation convention we can calculate directly 


Here 


dxi dpi dpi dxi ’ 


dx- ~dx ^ yPz ZPy ^ ~ &ziPy 

= ~dp ^ zPx 'Zpz ) = zS X i x5 Z i 

= ((yPz ~ zp y' ) = ySzi ~ 

7^ = 7^ (ZPx - xp z ) = 5 zi p x - SxiPz , 


which gives 


\^- J X ')Ly\ — (ftyiPz ftziPy ) ( zft x i xft z i) (jjft z i %ftyi) (ftziPx ftxiPz ) — %Py VPx — L z 

The generalized version of this is (we do not give all details here): 

\L J i , )I J j\ = e ijk^k • 
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The equation 


[Li, Lj] — eijkLk (8.19) 

is another example of the relation to quantum mechanics, since a closely similar relation (except from a factor ih) defines the 
algebra for angular momentum operators. One says that generators Gi for a Lie-group generally satisfy the relation 

[Gi,Gj\ = C ijk Gki (8.20) 

where C l3 k are called the structure coefficients. In our case the rotation group in three dimensions is called SO(3), and Cijk = 

^ ijk • 


3. Canonical transformation of Poisson brackets 

Poisson brackets are invariant under a canonical transformation: 

u M q , P = 

This can be easily seen for the harmonic oscillator, where 


Q = 



p = muq cot Q, 


P = E/u. 


( 8 . 21 ) 


4. Liouville’s theorem 

The phase space is spanned by the pi and q h axes. A point in the phase space corresponds to a definite state of the physical 
system. When the system develops in time, this point describes a so-called phase path. 
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Liouville’s theorem reads: 


Let dT = dqi • • • dq n • dpi • • • dp n be a volume element in the phase space. Under a canonical transformation an arbitrary 
volume of the phase space will be invariant: 



• * • dq n • dp 1 • • • dp n 



• • dQ n • dPi • • • dP n . 


( 8 . 22 ) 


Proof Under a transformation of variables in a multiple integral will generally 



• • • dQ n • dPi • • • dP n 



• • • dq n -dp i • • • dp n , 


where 


U _ d(Ql--.Qn; -Pl---Pn) 

is the Jacobi determinant. We have to show that for any canonical transformation one has D — 1. Mathematically, we can 
handle the Jacobi determinant as a fraction: 


d(Q i. 

■Qn,P 1 

-Pn) 

d(qi. 

■ • qn Pi • 

- Pn ) 

d(qi. 

■■qn,Pl- 

• -Pn) 

d(qi. 

• • qn Pi • 

-Pn) 


d(Qi...Qn) 


d(qi...q n ) 

P= const. 

d(Pl—Pn) 

dp!...P n ) 

g=const. 


We assume now the alternative 2 listed under canonical transformations, i.e. F\ Pi)- We then have pi 

Qi = Consider the ik-e lement in the numerator: 


dF 2 


/ d iJFo 

V dqk ) p dq k dPi ’ 

and likewise the ik-e lement in the denominator: 

f dpi\ _ d dF 2 
\dP k ) q - dP k d Qi ' 

Since a determinant is independent under an interchange of rows and columns, (i k), one gets D = 1 under a canonical 
transformation, as advertised. □ 

We have seen that the time development of the system can be described as a canonical transformation. Since the phase space is 
conserved under a canonical transformation, it follows that the phase volume is a constant. 


C. Hamilton-Jacobi theory 


Canonical transformations can be used to solve mechanical problems. What one wishes to find, by using canonical transforma¬ 
tions as a tool, are new coordinates that are cyclic. With cyclic coordinates the integration of the equations of motion (as we 
have seen several times already) can often be trivial. 


One obtains automatically new variables that are constants by requiring that that the transformed Hamilton function K should 
be equal to zero. The canonical equations then become 


Qi = 


dK 




dPi 


Pi = 



Here K is related to the generating function F via 


K = H- 


dF 

~dt' 
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which with K = 0 gives us 




dF 

~dt 


= 0. 


It turns out to be convenient to choose the solution F = F 2 = F 2 (q, P, t). We make use of the equation (8.5), p t = dF 2 /dqi, 
and change the notation by redubbing F 2 —> S. The result becomes 


H(qi, 


dS_ 

•••■} Q.m r\ 5 •••5 

dqi 


ds N ds 

a —’ ^ "3T 
UQn 


= 0 , 


(8.23) 


which is called the Hamilton-Jacobi equation. 


The Hamilton-Jacobi equation is a first order partial differential equation with n + 1 variables, namely gq,..., g n and t, for the 
generating function S. The S is called Hamilton’s principal function. 

We know that the new momenta are constants (we have transformed the problem to a phase space, i.e. a canonical set of 
coordinates): 


Pi = di = const.. 


We assume a solution of the form 


S — S^qi ,..., CLm ..., C^n +li 

where ai ,..., a n+ 1 are mathematically speaking n -hi independent constants of integration. Since 5 itself does not occur in 
(8.23), will S + a, with a an arbitrary constant, also be a solution. This has to correspond to one of the n + 1 solutions, and is 
of no physical significance since S only occurs in (8.23) in the form of derivatives. For our purpose it is sufficient to write the 
solution in the form 


S — S(qii •••? q.m t)' 

We can now choose the n integration constants equal to the new constant momenta Pi = ai . We thus have 

Pi = -£-S(q,a,t), (8.24) 

dq t 

as the first part of the transformation equations. The second half of the transformation equations between old and new coordi¬ 
nates, (8.5), is 

Q i = ^ s (q,a,t) = p i . (8.25) 

We assume finally that this equation is solved with respect to the original coordinates, so that 

Qi = qi{a,p,t). (8.26) 

When viewed mathematically, these equations demonstrate the equivalence between 

• the equations with which we started, namely 2 n canonical equations of motion (first order differential equations), and 

• the first order partial Hamilton-Jacobi differential equation. 


General procedure of solution 


Let us summarize the general procedure of solution given above: 

1. From a known Hamilton function H, we construct the Hamilton-Jacobi equation, 




and find the solution S = S(q, a , t) for Hamilton’s principal function S. 
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2. Thereafter we find 

and from that again 

3. Finally we find the original momenta, 


Qi = S(q,a,t ) = A, 


% = qi{a,P,t). 


Pi = ^S(q,a,t). 


A simplification can here be done if H does not contain t explicitly (conservative system). Then we can write 

S(q, a,t) = W (g, a) — at. 


The equation 


means that 


Thus, for a conservative system, 


rr dS „ 
H+ m~ 0 ’ 

rr 9W 
=0 


H = E = a = total energy. 
S(q, a,t) = W(q, a) — Et. 


(8.27) 
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Conclusion: for a conservative system the Hamilton-Jacobi equation (8.23) yields that 

H(q,a) = E, 

with 

_ dS _ dW 

dqi dqi ' 

Here W is called Hamilton’s characteristic function. 


(8.28) 

(8.29) 


Example 24. Harmonic oscillator. Consider the Hamilton function for the harmonic oscillator: 


H=^ + \kq\ 

2 m 2 


With p = || the condition H + ^ = 0 gives that 


1 fdSV 1 2 dS 

2m \ dq J + 2 kq + m~°- 

Since H does not depend on time we have S(q, a, t) = W(q, a) — at , implying 


2m \ oq J 


a 


W = Vmk J dqy-j^ — q 2 
S = Vmk J dq]J ^ — q 2 — at, 


which is Hamilton’s principal function. Of physical importance are only the partial derivatives of S. As in the text above we 
define 


t + F = 


tI 


dq 


— t 


2a 2 
k q 


■ U k 

— arcsm A: a / — 

l V 2a 


2a x 

q = — sm(cct + p), 


where (3 = flu, uo = and a = E. 

Consider finally the momentum: 

dS 


P 


dq 


= — Vmk\l -j- — q 2 = Vmk\l VV _ EV s i n 2 (c ot + /?) 


k 


2a 2a 


p = V2ma cos(c ot + /?). 

We see that this agrees with the relation p = mg. 
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